Virial inversion and density functionals
Sabine Jansen, Tobias Kuna, Dimitrios Tsagkarogiannis

TL;DR
This paper introduces a new mathematical inversion theorem for functionals in infinite-dimensional spaces, with applications to density function theory and improved convergence estimates for the virial expansion in statistical mechanics.
Contribution
It develops a novel inversion method using fixed point equations and combinatorial identities, enhancing convergence analysis for density functionals and the virial expansion.
Findings
Proves a new inversion theorem for power series functionals.
Provides rigorous convergence framework for inhomogeneous systems.
Achieves improved radius of convergence for the virial expansion of the hard sphere gas.
Abstract
We prove a novel inversion theorem for functionals given as power series in infinite-dimensional spaces and apply it to the inversion of the density-activity relation for inhomogeneous systems. This provides a rigorous framework to prove convergence for density functionals for inhomogeneous systems with applications in classical density function theory, liquid crystals, molecules with various shapes or other internal degrees of freedom. The key technical tool is the representation of the inverse via a fixed point equation and a combinatorial identity for trees, which allows us to obtain convergence estimates in situations where Banach inversion fails. Moreover, the new method for the inversion gives for the (homogeneous) hard sphere gas a significantly improved radius of convergence for the virial expansion improving the first and up to now best result by Lebowitz and Penrose (1964).
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Virial inversion and density functionals
Sabine Jansen
Mathematisches Institut, Ludwig-Maximilians-Universität, 80333 München, Germany
,
Tobias Kuna
Department of Mathematics and Statistics, University of Reading, Reading RG6 6AX, UK
and
Dimitrios Tsagkarogiannis
Dipartimento di Ingegneria e Scienze dell’Informazione e Matematica, Università degli Studi dell’Aquila, 67100 L’Aquila, Italy
(Date: 30 August 2019)
Abstract.
We prove a novel inversion theorem for functionals given as power series in infinite-dimensional spaces and apply it to the inversion of the density-activity relation for inhomogeneous systems. This provides a rigorous framework to prove convergence for density functionals for inhomogeneous systems with applications in classical density function theory, liquid crystals, molecules with various shapes or other internal degrees of freedom. The key technical tool is the representation of the inverse via a fixed point equation and a combinatorial identity for trees, which allows us to obtain convergence estimates in situations where Banach inversion fails. Moreover, the new method for the inversion gives for the (homogeneous) hard sphere gas a significantly improved radius of convergence for the virial expansion improving the first and up to now best result by Lebowitz and Penrose (1964).
Keywords: cluster and virial expansions – density functional theory – holomorphic functions in Banach spaces
MSC 2010 classification: 82B05, 82D15, 82D30, 47J07, 05C05
Contents
1. Introduction
Deriving functional expressions for thermodynamic quantities from microscopic models which are based on physical principles is one of the main challenges of both theoretical and computational methods in statistical mechanics. Furthermore, the use of such functionals is ubiquitous in applied mathematics for example in classical density function theory, liquid crystals, heterogenous materials, colloid systems, system of molecules with various shapes or other internal degrees of freedom. However, often the key point in variational calculus and the theory of PDE is to consider non constant densities and hence non translation invariant systems. One key mathematically rigorous result in this direction was the proof of the convergence of the virial expansion by Lebowitz and Penrose in 1964 [LP64], building on the previously established convergence of the activity expansion of the pressure and of the density. The proof consists out of three main steps: first to invert the density-activity relation, second to plug the resulting expansion of the activity as a function of the density into the pressure-activity expansion and resum, and finally bound the radius of convergence of the composed power series combining convergence results for the inversions and for activity expansions. Previous results [MGM40], based on formal manipulations of power series and combinatorics of graphs, had already identified the coefficients in the density series in terms of two-connected (“irreducible”) graphs. A by-product of the convergence result from [LP64] is the absolute convergence of the generating function for two-connected graphs, thus justifying formulas that were already in use.
This recipe for going from activity expansions to density expansions extends to quantities whose activity expansion is well understood, for example, the truncated correlation functions. However convergence proofs for other quantities are more delicate, as explained in detail in [KT18] for the direct correlation functions. Indeed, even though combinatorial series for various quantities are available, their derivation rests on formal manipulations and graph re-summations that have yet to be rigorously justified. The formal graph re-summations were developed in the 60’s mainly by the works of Morita and Hiroike [MH60, MH61] and of Stell [Ste64] on liquid state theory expansions for inhomogeneous fluids, allowing for position-dependent densities. In contrast, the convergence result from [LP64] and all subsequent works addresses homogeneous systems only.
Our goal, therefore, is twofold:
- (1)
Establish the validity of the inversion formulas for inhomogeneous fluids. 2. (2)
Prove the validity of re-summation operations on graphs by showing that the resulting power series are absolutely convergent.
As far as goal (2) is concerned, in a previous work [KT18] we proved convergence for some resummation of expansions leading to graphs with higher connectivity properties, but starting from the canonical ensemble. That choice was made in order to avoid the graph re-summations that come with the inversion, but also since it is more natural for expansions with respect to the density. In the current paper we prove the validity of these re-summations by inverting the density-activity relation, but a similar structure may be expected for other types of resummations as considered, e.g. inverting the truncated correlation vs activity relation. We intend to address all these issues in a subsequent work.
Concerning goal (1), since inhomogeneous system can be seen as a system of uncountably many species, when one considers the position as species. We consider goal (1) in this more general context of a system of (potentially uncountably many) species. In this way, we can treat at the same time as well systems of mixtures as with internal degrees of freedom. This generalization will not increase the complexity of the arguments involved.
At first sight, it may look as if goal (1) is best achieved with the help of inverse function theorems in complex Banach spaces, applied to the functional that maps the activity profile to the density profile , see Section 2.2. This works well for inhomogeneous systemsof e.g. objects of bounded size, e.g., hard spheres of fixed radius. It turns out, however, that Banach inversion fails for mixtures of objects of finite but unlimited size [JTTU14, Jan15], see Example 2.7. As a way out, mixtures of countably many species were treated with the help of Lagrange-Good inversion in [JTTU14], leaving the case of uncountably many species wide open.
Our first main result is a novel inversion theorem (Theorem 2.5) that addresses the above-mentioned difficulties and bypasses both Banach and Lagrange-Good inversion. The novelty is two-fold. First, we work on the level of formal series and relate the formal inverse to generating functions of trees or equivalently, solutions of certain formal fixed point problems (Proposition 2.6). This part is inspired by the combinatorial proof of the Lagrange-Good formula for finitely many variables given in [Ges87], we will consider this relation in more details in a forthcoming work. Second, we provide sufficient conditions for the convergence of the formal inverse, i.e., of a generalized tree generating functions (Theorem 2.3). The inversion theorem is of an abstract general nature and has the potential of being applied to other situations than the density-activity relation in statistical mechanics.
In our second group of results (Section 3), we apply the abstract inversion theorem to the concrete problem of inverting the functional that maps the activity profile in an inhomogeneous grand-canonical Gibbs measure (or even a general multi-species system) to the density profile. We exhibit domains on which the activity profile is written as a convergent series in the density profile, relate the coefficients to two-connected graphs, and show that the virial expansion for the pressure as a functional of the position-dependent density profile converges and is indeed given in terms of two-connected graphs (Theorem 3.5). These results work for general stable pair potentials.
Finally in Section 4 we apply the results to different more concrete choices of pair potentials. We demonstrate the power of our approach for systems of homogeneous hard spheres, our results yield a significant improvement over previously available bounds (Theorem 4.1). For general non-negative potentials the improvement is almost 27%. For mixtures of thin rods with different orientiations, we obtain a series representation of the (grand-canonical) free energy as a function of the overall density of rods and the probability density on different orientations (Theorem 4.7 and Corollary 4.8). In fact, in an early work, Onsager [Ons49] derived a density functional for liquid crystals, keeping track of the orientation of the atomistic elongated molecules. Working in the canonical ensemble he discretized the space of orientations and assigned each value to a species obtaining a multi-species canonical partition function for (for finitely many) species. Following the new developments [PT12], the convergence of this expansion can be easily proved to be valid in the low density regime. Our result allows for a direct treatment of continuous values of the orientation as inhomogeneous systems. It bypasses the need to estimate errors from discretizing the orientation space, at the price of a detour through the grand-canonical ensemble. The improvements we are obtain are purely due to the improved inversion results as we used the classical tree-graph bound in the grand-canonical ensemble [Pen63], [Rue69],[MM91], [PU09], [PY17] and for marked systems [Kun01].
Following the above discussion we summarize below the main outcomes of this paper:
- (1)
Proof of a novel inversion theorem (Theorem 2.3), applicable to the inversion of the density-activity relation for inhomogeneous systems (Section 3), yielding a convergent power series of the inverse map. 2. (2)
Key technical tool: a fixed point equation for generating functions of special trees (Proposition 2.6). 3. (3)
Various applications: inhomogeneous gas, liquid crystals, molecules with various shapes (internal degrees of freedom), see Section 4. 4. (4)
Comparison to existing theorems of inversion in Banach spaces (Proposition 2.8 and Theorem 2.10). 5. (5)
Discussion of the improvement of the radius of convergence for the (homogeneous) hard sphere gas (Section 4.1).
2. General inversion theorems
2.1. Main inversion theorem with proof
Let be a measurable space and the set of -finite non-negative measures on . Further let be the set of complex linear combinations of measures in . When there is no risk of confusion, we shall write and for short. Suppose we are given a family of measurable functions , . We assume that each is symmetric in the ’s, i.e.,
[TABLE]
for all permutations . When we say that a power series converges absolutely, we mean that
[TABLE]
where is the total variation of 111If with mutually singular -finite non-negative measures, then . Let be the domain of convergence of the associated power series, that is if and only if the power series converges absolutely in the above sense. We set
[TABLE]
We are interested in maps of the form
[TABLE]
given by
[TABLE]
where is just a notation for . The latter is useful whenever one wants to stress the instead of the dependence. Thus is absolutely continuous with respect to with Radon-Nikodým derivative . We want to determine the inverse map ,
[TABLE]
Suppose for a moment that such an inverse map exists. Clearly is equivalent to with Radon-Nikodým derivative . Consequently we should have
[TABLE]
This observation is the starting point for our inversion result, namely the family of power series given by
[TABLE]
should solve
[TABLE]
and therefore
[TABLE]
In Proposition 2.6 below we provide a combinatorial interpretation of as the exponential generating function for colored rooted, labelled trees whose root is a ghost of color (i.e., the root does not come with powers of in the generating function). For our main inversion theorem, however, it is enough to know that the fixed point equation () determines the power series uniquely.
Lemma 2.1**.**
There exists a uniquely defined family of formal power series
[TABLE]
with measurable and symmetric in the ’s, that solves () in the sense of formal power series.
As the above expressions are interpreted in the sense of formal power series, neither the series need to converge nor the integrals need to exist.
Proof.
Set . Let be the coefficients of the series in the exponential in (), i.e., each is measurable, and we have
[TABLE]
in the sense of formal power series. Then
[TABLE]
see Eq. (A.8) in Appendix A. The third sum is over ordered partitions of , indexed by , into disjoint sets , with explicitly allowed. For example,
[TABLE]
More generally, depends on alone. This is the only aspect of (2.9) that enters the proof of this lemma.
For , let be the collection of set partitions of . The family solves () in the sense of formal power series if and only if for all and , we have
[TABLE]
see Eq. (A.7) in Appendix A. In particular,
[TABLE]
which determines and uniquely. A straightforward induction over , exploiting that the right-hand side of (2.10) depends on alone (via ,…, ), shows that the system of equations (2.10) has a unique solution . ∎
Remark 2.2*.*
The proof of Lemma 2.1 shows that the coefficients can be computed recursively.
Next we provide a sufficient condition for the absolute convergence of the series .
Theorem 2.3**.**
Let be the unique solution of () from Lemma 2.1. Assume that for some measurable function , the measure satisfies, for all ,
[TABLE]
Then, for all , we have that
[TABLE]
and the fixed point equation () holds true as an equality of absolutely convergent series.
Proof.
The inductive proof is similar to [Uel04, PU09]. Let , , be the partial sums for the left-hand side of (),
[TABLE]
We prove by induction on , building on the proof of Lemma 2.1. The estimate for the full series then follows by a passage to the limit .
For , we have and the inequality is trivial. Now assume . The triangle inequality applied to Eqs. (2.9) and (2.10) yields the same iterative formula for as for just with replaced by \bigl{|}A_{n}(q;x_{1},\ldots,x_{n})\bigr{|}. We noted before that, if we consider and hence only for , then on the right hand side only with appear. However, there are some terms on the right hand side, which as well only contain with but which come from some term on the left hand side for . Adding these missing terms, we reconstruct an exponential on the right hand side. As all of these additional terms are non-negative, we get the following inequality, instead of an equality
[TABLE]
The induction is complete. It follows that () holds true. In particular, the series is absolutely convergent and satisfies . By condition (), the right-hand side of the fixed point equation () is absolutely convergent as well. Therefore Eq. () holds true not only as an identity of formal power series but in fact as an identity of well-defined complex-valued functions. ∎
Remark 2.4*.*
For non-negative functions , the convergence estimate is sharp, in the following sense: If is a non-negative measure and , then there exists a function such that () holds true. Indeed, an induction over , based on Eqs. (2.9) and (2.10), shows that if the ’s are non-negative, then the coefficients and are non-negative as well. If , we may define
[TABLE]
Notice because of for non-negative and . It follows from () that the inequality () holds true and is in fact an equality. This was already noticed in [Jan18, Proposition 2.9] and the proof of Theorem 4.2(b) in [Jan15].
Now that we have addressed the convergence of the series , we may come back to the inversion of the map . For measurable , let
[TABLE]
For , define by
[TABLE]
Theorem 2.5**.**
For every weight function , there is a set such that is a bijection with inverse .
Proof.
Let be the image of under . By Theorem 2.3, the set is contained in , in particular if with , then is well-defined with
[TABLE]
For the last identity we have used the fixed point equation (). Thus we have checked that if , with , then . Conversely, if with , then by definition of there exists such that , hence and . ∎
Finally we provide a combinatorial formula for the function appearing in the inverse . Consider a genealogical tree that keeps track not only of mother-child relations, but also of groups of siblings born at the same time. This results in a tree for which children of a vertex are partitioned into cliques (singletons, twins, triplets, etc.). Accordingly for we define as the set of pairs consisting of:
- •
A tree with vertex set . The tree is considered rooted in [math] (the ancestor).
- •
For each vertex , a set partition of the set of children222The members of the partition are assumed to be non-empty, except we consider the partition of the empty set. of . If is a leaf (has no children), then we set .
For , we define the weight of an enriched tree as
[TABLE]
with . So the weight of an enriched tree is a product over all cliques of twins, triplets, etc., contributing each a weight that depends on the variables of the clique members and the variable of the parent.
Proposition 2.6**.**
The family of power series from Lemma 2.1 is given by
[TABLE]
Proof.
We check that the generating function of the weighted enriched trees satisfies (). Functional equations for generating functions of labelled trees are standard knowledge [BLL98], we provide a self-contained proof for the reader’s convenience. Define
[TABLE]
Further define but restricting the sum to enriched trees for which (all children of the root belong to the same clique). Further set and . For a finite non-empty set, define in the same way as but with replaced by . For we define and assign the empty tree the weight . For non-empty trees, weights are defined in complete analogy with (2.13).
Clearly there is a bijection between enriched trees and set partitions of together with enriched trees , for which all children of the root are in the same clique. Indeed, the number corresponds to the number of cliques in which the children of the root are divided and the blocks group descendants of the root, where contains the children of the root which are in the -th. clique and all their decedents. The weight of an enriched tree is equal to the product of the weights of the subtrees . Therefore
[TABLE]
Furthermore there is a one-to-one correspondence between, on the one hand, enriched trees where all the children of the root are in the same clique and on the other hand tuples consisting of non-empty set , an ordered partition of (with allowed), and a collection of enriched trees . Overall, and give a partition of . The set consists of the labels of the children of the root, that is the one clique which all these children form and for each , the set consists of the labels of the descendants of . ( means that is a leave of the tree) It follows that
[TABLE]
It follows from Eqs. (2.14) and (2.15) that the formal power series with coefficients solves (), therefore Lemma 2.1 yields . ∎
2.2. Scale of Banach spaces. Banach inversion
Formally, one is tempted to say that is given by a power series with leading order , hence differentiable with derivative at the origin given by the identity matrix; therefore the existence and regularity of the inverse map should follow from some general inverse function theorem. When is finite so that can be identified with a finite vector , with , this can be implemented and is indeed a standard ingredient for the virial expansion for single-species systems [LP64].
For infinite spaces one may try a Banach inversion theorem. This works in some cases (see Theorem 2.10 below), but there are situations where the Banach inversion theorem is doomed to fail, as illustrated by the following example. The example is inspired by concrete features of the multi-species Tonks model [Jan15] for rods of unbounded lengths .
Example 2.7*.*
Let and identify measures on with sequences . Consider the map given by
[TABLE]
Let be the space of bounded complex-valued sequences equipped with the supremum norm and the space of sequences with , for some fixed scalar . We may view as a map from the open ball to . The derivative is the identity map or more precisely, the embedding , . It is injective and continuous but it does not have a continuous inverse, therefore Banach inversion theorems are not applicable. The issue arises because the norms and are not equivalent. A target space with inequivalent norm is needed because, for every —no matter how small— as .
It turns out that the natural analytic framework for our inversion theorem uses not a single Banach space, but instead a scale of Banach spaces, as is the case for the Nash-Moser theorem [Ham82, Sec16]. We explain this aspect in more detail here as this clarifies the issues raised in [JTTU14, Section 2.2] and [Jan15, Theorem 2.8].
Let us fix a reference measure and we restrict to measures that are absolutely continuous with respect to . Remember that is absolutely continuous with respect to the measure , so if is absolutely continuous with respect to , then so is . We work with the Radon-Nikodým derivatives rather than the measures and write
[TABLE]
similarly for and . Fix a weight function and assume that satisfies condition (). Let be the space of bounded functions (precisely, equivalence classes up to -null sets), equipped with the supremum norm
[TABLE]
Write for open balls of radius centered at [math]. For measurable and , define the weighted supremum norm
[TABLE]
and let be the associated Banach space. Notice the inclusions
[TABLE]
When is essentially bounded, then the inclusions are equalities and the norms , are equivalent. For , the inclusions are strict and the norms are inequivalent. Let and be the open balls of radius , centered at the origin, in and , respectively.
Proposition 2.8**.**
Assume that satisfies condition (). Then the maps
[TABLE]
are holomorphic, as maps between the Banach spaces and . Moreover we have and for all and .
The proposition is proven at the end of this section. The inclusions and follow from the inequalities
[TABLE]
valid for all , , and all , assuming satisfies () by using Theorem 2.3. The difference to the previous results is that we show here uniform convergence of the power series expansions of and in the relevant norms.
We briefly check (2.16). If then for -almost all . Since satisfies condition (), it follows by Theorem 2.3 that the measure is in the domain of convergence of (though it does not fulfill condition ()) and , consequently . If , then, using again that satisfies condition (), we see that in this case the measure satisfies condition () as well and the bound () yields .
It is an immediate consequence of Proposition 2.8 that is a bijection from onto . If is essentially bounded, then all norms are equivalent, hence and are holomorphic as maps in and is open in the non-weighted sup norm . Moreover we have the inclusion
[TABLE]
and we obtain the following corollary.
Corollary 2.9**.**
Assume that satisfies condition () and in addition . Then maps some open subset of biholomorphically onto , and the inverse map is .
Corollary 2.9 points out a situation where Banach inversion does work, which raises the question whether a similar result can be obtained directly, bypassing the introduction of a weight function . This is indeed possible. Let us fix a reference measure as before but drop the requirement that satisfies (). Set
[TABLE]
and let
[TABLE]
Theorem 2.10** (Banach inversion).**
Assume that (2.17) holds true for some and let be as in (2.18). Let
[TABLE]
Then the functional maps some open neighborhood of the origin biholomorphically onto the open ball .
Proof.
The map is holomorphic. The proof of the holomorphicity is similar to the proof of Proposition 2.8 and therefore omitted. The derivative at the origin is the identity: . On , the map is bounded by . Therefore, by Theorem B.6, for each , the functional maps the open ball biholomorphically onto a domain covering . We optimize over and obtain the theorem. ∎
Remark 2.11*.*
Let us compare the radius of convergence of the inverse function which we obtained with the technique of Banach inversion theorem with the convergence results we obtain with the new inversion technique described in Section 2, namely Corollary 2.9. Let us call the radius of convergence in the latter case. We will show that and thus even in those situations where a direct application of Theorem B.6 is possible, it yields a bound that is worse than ours.
Let us first derive an expression of in terms of as defined in (2.17). If satisfies condition () with , then . Conversely, assume for some and consider constant weight functions . Then, for every , choosing small enough we may assume and then the rescaled measure satisfies condition (). Noting that
[TABLE]
we deduce from Corollary 2.9 that is contained in the domain of convergence of the density expansions. An optimization over and shows that the domain of convergence contains the open ball with radius
[TABLE]
Below we check that .
Proof of .
Let and . By definition of , there exists such that . Set . Then , thus and
[TABLE]
It follows that . Conversely, let . By definition of there exists such that , hence . Set , then , , and
[TABLE]
It follows that . We let and deduce . ∎
Proof of Proposition 2.8.
We only need to prove that the maps are holomorphic. Consider first the map . We have with
[TABLE]
and
[TABLE]
see Appendix A, Eq. (A.7). We show first that is holomorphic, by proving that the series (2.19) converges uniformly in the relevant operator norms. Set
[TABLE]
Then for all , we have
[TABLE]
because satisfies condition (). In particular, the power series has radius of convergence . It follows from Cauchy’s inequality for the Taylor coefficients of the series that for all ,
[TABLE]
Therefore, we can bound
[TABLE]
As a consequence, the map defined on given by
[TABLE]
satisfies for any (we will choose appropriately at the end)
[TABLE]
It follows from the polarization formulas, see e.g. [Muj06], that the associated multilinear map from to is bounded, whenever or , hence is a continuous -homogeneous polynomial (see Definition B.1). By (2.21), the series converges uniformly in . Therefore, the map as a map
[TABLE]
is holomorphic. For , it is also holomorphic as a map
[TABLE]
because and .
Now we return to . By (2.20), we have
[TABLE]
hence
[TABLE]
whenever and . In order to prove the differentiability, let us introduce
[TABLE]
which will be shown to be the derivative of . Using Cauchy’s inequality we get from the holomorphicity of that there exists a with , whenever . Then for we get that
[TABLE]
Thus is bounded. Let us show differentiability directly. Write
[TABLE]
which can be estimated as
[TABLE]
Hence is holomorphic on with values in for . Furthermore, is bounded by because of (2.24). The result is the extremal case .
The map is treated in a completely analogous way. We start from . Since we assume that satisfies condition (), we know from Theorem 2.3 that
[TABLE]
We can now repeat the reasoning for , substituting for , for , and the bound (2.25) for (2.20). ∎
2.3. An equivalent fixed point equation
In the proof of Lemma 3.9 in Section 3 we need another characterization of the coefficients .
Lemma 2.12**.**
The family from Lemma 2.1 is the unique family of formal power series that solves
[TABLE]
Eq. () reflects that while the fixed point equation (), defining , reflects that because .
Proof.
Let us write instead of as long as we do not know that the family from Lemma 2.1 satisfies (). For the existence and uniqueness of a solution to (), we note that Eq. () translates into a triangular system of equations for the coefficients . The details are similar to the proof of Lemma 2.1 and therefore omitted.
We start from (), written for ’s instead of ’s, and insert on both sides. This insertion corresponds precisely to the second notion of composition discussed in Appendix A, see Eq. (A.8), and in particular it is a well-defined operation on formal power series. The composition yields two formal power series in , one for the left and one for the right side, called and respectively, and of course we must have . On the right side we get, by (),
[TABLE]
On the left side we have
[TABLE]
The product inside the integral is equal to because of (), therefore and we conclude from that . In particular, solves (). ∎
3. Virial expansion. Density functional
In this section we are consider functions of a special form, cf. (3.8) below, which are coming from a system of objects interacting via a pair potential.
Let be a measurable pair potential (). We assume that for some measurable function , we have the stability condition
[TABLE]
for all and . In addition, we also assume that for all and some function we have
[TABLE]
Define
[TABLE]
for and , . Let us introduce for the next few calculation up to (3.6) an extra assumption on , namely
[TABLE]
In the case that and , respectively, is a translation invariant function, measure respectively, then the above condition means that the volume of with respect to the Lebesgue measure is finite. Hence we say that we are in the “finite volume” case. We will point out which formulas also hold in the “infinite volume” case.
The grand-canonical partition function at activity and inverse temperature is
[TABLE]
Condition (3.3) ensures that is finite. The one-particle density is
[TABLE]
Notice
[TABLE]
see Eqs. (A.4) and (A.5) in Appendix A applied to . We bring the expression for into the form (2.5). This allows us to extend the definition (3.5) to activities that do not satisfy the finite-volume condition (3.3). Set
[TABLE]
Let be the set of connected graphs with vertex set , and the edge set of a graph and
[TABLE]
The aim of the section is to use the result of the previous section for this particular . Furthermore, define the well-known Ursell functions
[TABLE]
Let us recall some known results.
Lemma 3.1**.**
Let be as in (3.8) and define as in (2.3). Let satisfy only
[TABLE]
for some weight function and all . Then is in the domain of convergence .
If in addition satisfies the finite-volume condition (3.3), then the density defined in (3.5) is equal to , moreover
[TABLE]
with absolutely convergent integrals and series.
The lemma follows from the tree-graph inequality due to [PY17] and additional combinatorial considerations, compare for example [JTTU14, Eq. (4.17)]. The details are similar to aspects of the proof of Lemma 3.7 and therefore omitted.
Definition 3.2**.**
For activities that satisfy (3.10) but not necessarily the condition (3.3), we adopt the equality as the definition of the density.
Remark 3.3* (Physical interpretation of ).*
Let be the total interaction of a particle at with the particles . By (3.5) and Lemma 3.1, we have
[TABLE]
where denotes the expectation with respect to the grand-canonical Gibbs measure. Thus is the excess free energy for a test particle pinned at the location .
Let be the set of bi-connected graphs, i.e., graphs that stay connected upon removal of a single vertex. Define
[TABLE]
We want to invert the map and express the inverse with bi-connected graphs. Before that we derive a convergent result for power series with coefficients given by bi-connected graphs.
Theorem 3.4**.**
Let . Suppose there exist functions with on such that
[TABLE]
for all . Then
[TABLE]
for all .
Define by
[TABLE]
Theorem 3.5**.**
There is a set such that is a bijection from onto , and for every , , we have if and only if
[TABLE]
where the latter converge in the sense that () holds.
If fulfills () for some and for the same functions and , then and hence .
If instead the following conditions including also a “finite volume condition” holds,
[TABLE]
then also
[TABLE]
The condition is a condition directly in terms of which is sufficient to guarantee that . Recall that was just defined indirectly as the image of .
Formula (3.14) does not make any sense in the “infinite volume case” even if we consider the translation invariant case as discussed below (3.3). In this case, though, the right hand side is proportional to the volume of , up to boundary errors. Hence, divided by the volume has a well defined limit.
For the definition of the free energy, we fix a reference measure on (for example, the Lebesgue measure on ). The (grand-canonical) free energy of a given density profile is defined via the Legendre transform of as
[TABLE]
with the Radon-Nikodým derivative of with respect to the reference measure . The supremum in (3.15) is over all non-negative measures that are absolutely continuous with respect to and such that the integral with the logarithm is absolutely convergent.
Theorem 3.6**.**
Assume that is absolutely continuous with respect to and satisfies
[TABLE]
then
[TABLE]
with absolutely convergent integrals and sum.
Let us first check that condition () is sufficient for the convergence of .
Lemma 3.7**.**
If satisfies condition () for some with , then satisfies and in particular condition (), where is defined as in (2.3) with given by (3.8).
Proof.
Set
[TABLE]
The first factor in (3.8) can be bounded as follows
[TABLE]
Indeed, this follows by induction in using
[TABLE]
and using that . Using this bound, we get
[TABLE]
In order to bound , we use a recent tree-graph inequality due to Procacci and Yuhjtman [PY17] in the form presented in [Uel17]. Then
[TABLE]
with the set of trees with vertex set . As a consequence, if a non-negative measure satisfies
[TABLE]
for all , then
[TABLE]
The inductive proof of (3.21) is similar to the proof of [PU09, Theorem 2.1] and therefore omitted. Condition () implies that satisfies
[TABLE]
Hence (3.20) and (3.21) hold true, and we can further bound (3.19) by
[TABLE]
which completes the proof. ∎
Next let us relate the coefficients of with bi-connected graphs.
Lemma 3.8**.**
The formal power series with coefficients (3.8) satisfies
[TABLE]
Proof.
The lemma follows from well-known identities for connected and bi-connected graphs, see for example [Ste64, Ler04, Far12, MH61], we sketch the argument for the reader’s convenience. If is a finite non-empty set, consider the following classes of graphs with vertex set :
- •
, the connected graphs on ;
- •
, the biconnected graphs on ;
- •
, the connected graphs that stay connected when removing [math] and the incident edges (equivalently, the connected graphs for which [math] is not an articulation point).
If is a graph with vertex set , define . Then
[TABLE]
In view of (A.7), setting , the coefficients of are given by
[TABLE]
By Eq. (A.8), the right-hand side of (3.22) is a power series with coefficients
[TABLE]
Eq. (3.24) allows us to rewrite as a sum over tuples consisting of an integer and graphs , where and form a partition of with allowed. Given such a tuple , a new graph is defined by gluing each to at the vertex (the vertex is identified with root [math] of ). Precisely, is an edge of if and only if:
- •
either and ,
- •
or for some we have and ,
- •
or for some we have and (or vice-versa) and .
In the new graph , each of the vertices is an articulation point (that is upon the removal of and the edges incident to the graph has a connect component which does not contain [math]. However, note that there can be other articulation points inside the ’s!), and the support of the graph consists of those vertices for which every path connecting to [math] has to pass through . The weight of the new graph is equal to the product of the weights of the ’s.
The rule defines a one-to-one correspondence between the tuples under consideration and graphs , and the weights are multiplicative. One deduces that is given by a sum over graphs and weights as in (3.23), therefore (3.22) holds true. ∎
As a consequence we can identify the coefficients of .
Lemma 3.9**.**
For given by (3.8), the family from Lemma 2.1 is given by
[TABLE]
Proof.
Lemma 3.8 yields
[TABLE]
Hence the right-hand side of (3.25) solves the fixed point equation () as considered in Lemma 2.12,furthermore the lemmas yields that, as the solution of the fixed point equation, the right hand side must be equal to the family from Lemma 2.1. ∎
Proof of Theorem 3.4.
If satisfies (), then by Lemma 3.7 it also satisfies (). However, by Theorem 2.3, it follows that () holds true as well, in particular is absolutely convergent and . Combining Eqs. (3.25) and () we get
[TABLE]
as formal power series, that means, that the coefficients of the series coincides. If we take the absolute value of the coefficients and we reconstruct the right hand side of the above equality one gets that
[TABLE]
where we define
[TABLE]
The right-hand side is bounded by because of () and (). ∎
Proof of Theorem 3.5.
Let as in (2.12). Set . By Lemma 3.7, we know that hence Theorem 2.5 guarantees . It follows from Theorem 2.5 that is a bijection from onto with inverse , hence if and only if . We insert the formula (3.25) from Lemma 3.9 for and obtain (3.12).
Let satisfy () and , then by Lemma 3.7. By Definition 3.2, the density is given by which is bounded by . As in the sense of formal power series and is a convergent on , it remains to show that the composition is also convergent. For that we do not only need that but also that all the interchanges are allowed, that is, an estimate in terms of and , namely (). Therefore, we finally get .
As an equality of formal power series, Eq. (3.14) follows from the dissymmetry theorem for connected and biconnected graphs and power series manipulations similar to the proof of Lemma 3.8. Precisely, we have the following identity
[TABLE]
The proof of (3.27) is easily adapted from [JTTU14, Theorem 3.1] or [Ler04] and therefore omitted. The first part of condition (3.13) is condition (3.10) from Lemma 3.1, we have established (3.14) in the sense of formal power series. Next, we check absolute convergence of the power series associated with the terms in Eq. (3.27).
Let us consider (3.27) term by term starting from the left. Consider
[TABLE]
The first part of condition (3.13) is the same as condition (3.20) with instead of , so we may apply the bound (3.21) and get that
[TABLE]
Hence the formal power series for is converging exactly in the sense that (3.28) is finite.
Next, by (3.29) and condition (3.13), we also have
[TABLE]
which is the sense in which the power series for converges.
Finally, define which by (3.29) is bounded by .
Now is in by the second condition in (3.13) and therefore is in as well. Thus we can bound
[TABLE]
where in the third but last inequality we applied () with instead of . At the very end we have used again condition (3.13). This is the sense in which the third term converges. Note that the sense of convergence is strong enough, such that that also re-ordering of the terms is converging so long one does not break up . As a consequence, Eq. (3.14) holds true not only as an equality of formal power series but also as an equality of convergent sums. ∎
Proof of Theorem 3.6.
The standard line of reasoning is as follows: we check that the solution to the equation —which exists by Theorem 3.5—is a maximizer in (3.15), deduce a formula for in terms of the maximizer , plug in (3.12) and (3.14), and obtain the statement. The full proof requires us to check that all steps are fully justified.
It is convenient to rewrite the definition (3.15) as
[TABLE]
where the supremum is taken over all measurable such that .
Let satisfy the assumptions of the theorem. By Theorem 3.5, the measure satisfies . Therefore, it is of the form with
[TABLE]
We check that is a maximizer in (3.32). As a preliminary observation, we note that using (), therefore condition (3.16) yields . Thus does indeed belong to the set over which the supremum in (3.32) is taken.
Let be another function with . We need to check that
[TABLE]
By the last condition in (3.16), the measure satisfies condition (3.3) and so and the right-hand side in (3.33) is finite. If , then the inequality (3.33) holds trivially true. If , then the inequality (3.33) is equivalent to
[TABLE]
and it will be checked with the help of convexity. Set
[TABLE]
It is a well-known consequence of Hölder’s inequality that is convex.
Next we check that the right derivative of at zero exists and is given by . We look at the derivative of first. Set . We have
[TABLE]
To facilitate differentiation, we check that configurations with infinite ’s do not contribute. As we have -a.e. that . Furthermore, we can see that
[TABLE]
where we used in the first equality that and in the inequality that . By choice of , the integral is finite, hence , -almost everywhere. It follows that the last expression in (3.36) vanishes, hence also all preceding expressions in the chain of inequalities vanish. Therefore we have that -a.e. holds . The same holds for . As only if either or are infinite we have that
[TABLE]
has full -measure and hence is well-defined on . The considerations above yield
[TABLE]
for all . We also have as that
[TABLE]
and therefore it holds that
[TABLE]
Each integrand goes to zero as , we need a -independent integrable upper bound in order to apply dominated convergence. For and we have
[TABLE]
If , pick and assume . We apply the inequality to and find that the upper bound is . Altogether we find
[TABLE]
This inequality applied to , and yields, for , that the integrand in (3.39) is bounded in absolute value by
[TABLE]
When one integrates over , multiply with , sum over , one obtains
[TABLE]
Thus we may apply dominated convergence to (3.39) and find that indeed
[TABLE]
from which we deduce . We have already observed that is convex and hence one has that , which for is precisely the inequality (3.34). It follows that is a maximizer in (3.33) and
[TABLE]
The final step is to insert the expression for from Eq. (3.14) in Theorem 3.5, keeping in mind that . This then yields (3.17).
To justify the application of (3.14), we could in principle impose conditions on that guarantee that satisfies the condition (3.13) from Theorem 3.5, however this would result in more restrictive conditions and therefore we take a slightly different approach. We start from the formal power series identity
[TABLE]
which follows from (3.14) and . It is justified, as a formal power series identity, without any conditions on . Additional arguments are needed to ensure that (3.41) holds true as an equality of convergent expressions. The exponential of the left-hand side of (3.41) is the formal power series
[TABLE]
see Eq. (A.8) in Appendix (A). The set is non-empty but is allowed (we agree ). We have
[TABLE]
with
[TABLE]
The term in parentheses is smaller or equal to by our assumption and Lemma 3.7, therefore
[TABLE]
by the last assumption on in (3.16). It follows that satisfies the finite-volume condition (3.3), hence is finite and thus (3.43) is finite. It follows that (3.42) is equal to not just as a formal power series but as an equality of convergent series.
Similar considerations apply to the right-hand side of (3.41). It follows that (3.41) holds true as an equality of convergent series. We plug the expression for from (3.41) into the formula (3.40) and obtain the expression (3.17) for the free energy. ∎
4. Examples
4.1. Homogeneous gas
Consider a homogeneous gas of particles in a domain , interacting via a translationally invariant pair potential , with . The potential is assumed to be stable,
[TABLE]
for some , all , and all . Furthermore, we assume
[TABLE]
Further assume that for some . Mayer’s irreducible cluster integrals are defined as
[TABLE]
which in terms of the coefficients from (3.11), can be expressed as
[TABLE]
The grand-canonical partition function at inverse temperature and activity is defined in the usual way, and the pressure is given by
[TABLE]
with the limit taken along van Hove sequences [Rue69]. Further set
[TABLE]
It is well-known [Rue69] that if , then the limit (4.2) and the derivative (4.3) exist, moreover they define functions that are analytic in (at least), we use the same letters for the analytic extensions to the complex disk. We fix and drop the -dependence from the notation in and .
Theorem 4.1**.**
- (a)
If satisfies , then . In particular, the radius of convergence of is bounded from below by
[TABLE] 2. (b)
There exists some neighborhood of the origin with
[TABLE]
such that is a bijection from onto the open ball , with inverse
[TABLE] 3. (c)
For all , we have
[TABLE] 4. (d)
For all , the Helmholtz free energy is given by
[TABLE]
Let us compare our result to what was known before. The bound (4.4) should be contrasted with the best known bound
[TABLE]
where
[TABLE]
(the lower bound is sharp, it is actually , cf. [Tat13]) and
[TABLE]
For non-negative pair potentials, we have and (4.7) coincides with the lower bound proven by Lebowitz and Penrose [LP64], who also proved the lower bound in (4.8). For attractive pair potentials, the bound (4.7) is an improvement on the bound from [LP64], which was proven in [Pro17], where the constant is called the Basuev stability constant. The constant also enters an asymptotic upper bound to as , see [Jan12, Theorem 2.8].
Let us compare our bound (4.4) with (4.7). It differs in two ways: it has a different constant and a different exponential . Our constant is better but for attractive interactions our exponential in general is worse. As a consequence, for non-negative interactions, our bound yields a considerable improvement over the bound from [LP64] and hence (4.7)
[TABLE]
therefore our bound improves substantially all known bound. The improvement subsists for attractive interactions with small . For large or strong interactions, the bound (4.7) due to [Pro17] trumps ours.
Remark 4.2* (Attractive potentials).*
Additional work is needed to see whether our exponent in (4.4) can be replaced by the exponent as in (4.7). This is related to the fact that bounding ’s in the Mayer expansion may sometimes be better than bounding in the representation . Indeed, in our approach, the factor comes up in Lemma 3.7 where, in order to write the density as an exponential and bound the exponent, we split the expansion of and we get an additional factor in Eq. (3.18).
Remark 4.3* (Relation with Lagrange inversion).*
After the proof of Theorem 4.1 we will explain how to recover our bound (4.4) in the case based on a slightly different treatment of the Lagrange inversion from [LP64], and where exactly our gain is achieved.
Remark 4.4* (Further improvements for non-negative pair potentials).*
The factor could be further improved using our techniques combining them with the refined tree-graph inequality from [FPS07], i.e., working with trees where children communicate, resulting in additional constraints on trees. Instead of the generating function of the trees, one has to consider the solution of the equation
[TABLE]
where are defined as in [FPS07], where it was used for the activity expansion. Doing so, for hard disks, that is , one obtains as a lower bound for the radius of convergence of the virial expansion instead of .
Proof of Theorem 4.1.
We apply the considerations from Section 3 to the case , the Borel sets, and specialize to translationally invariant measures with a constant scalar . For such a measure the measure given by is translationally invariant as well, we write and note that is equal to the limit (4.3), moreover with
[TABLE]
Conversely, if is a translationally invariant measure, then the inverse from is translationally invariant as well.
By Theorem 3.5 applied to , constant functions and , if the number satisfies
[TABLE]
for some with , then
[TABLE]
(remember (4.1) and ()). Condition (4.10) is further evaluated as
[TABLE]
Therefore if , then condition (4.10) holds true with and Eq. (4.11) holds true with . Part (a) of the theorem follows.
Part (b) follows from the first part of Theorem 3.5, with and . Then by Theorem 2.3. For there exists with such that if and only if . Note that means that () holds for an instead of and not necessarily .
For part (c), we note that the validity of (4.5) for sufficiently small is already known [LP64]. Alternatively, we may deduce from Theorem 3.5 by working first in finite volume and then taking the infinite-volume limit. This way of proceding guarantees the validity of (4.5) under the additional condition for some . The additional condition is eliminated by invoking analyticity: The left and right sides of (4.5) define functions of that are analytic in and coincide on some non-empty open ball, therefore they are equal on all of .
Theorem 3.6, using that in part (a) we have shown whenever , gives part (d) of the theorem in finite volume. By part (a) the radius of convergence of the series in (4.6) is independent of the volume. Combining with the translation invariance we see that the right-hand side of (3.17), divided by the volume, converges to the right-hand side of (4.6) in the infinite-volume limit. Furthermore, that the free-energy is the Legendre transform of the pressure can be extended to the infinite volume as well. We note that the validity of (4.5) for sufficiently small was already known [LP64]. ∎
Let us provide an alternative derivation of the bound (4.4) for non-negative potentials (). The key point in [LP64] is a lower bound for the radius of convergence of the expansion in as
[TABLE]
which is derived in [LP64] using a Lagrange inversion /tk, where is the convergence radius of at zero. A lower bound for is then deduced from a lower bound for . This is done in [LP64] (and also in [Tat13]) with the help of the triangle inequality . It turns out that if, instead, one uses the exponential structure and an upper bound for one can recover our bound (4.4) from (4.12). Let us explain the strategy. Our aim is to prove the following chain of inequalities
[TABLE]
where is the generating function of labelled rooted trees (equivalently, with the Lambert function) and is the radius of convergence of the analytic function .
The first inequality in Eq. (4.13), merely uses the idea . The second inequality can be derived in several ways. It follows directly from Penrose’s tree-graph inequality, namely as in Lemma 3.7 use estimate (3.18) with , then by the tree-graph inequality . Hence one gets . Alternatively, following more the type of results used in this article, one gets from the inductive proof of Theorem 2.1 in [PU09], cf. Lemma 3.7 as well, that whenever . It remains to optimize over .Recall that is converges on with and is also the branch of the real solution of the relation with . Hence, satisfies, for ,
[TABLE]
Since diverges for , Eq. (4.14) stays true for if we interpret the infimum of the empty set as infinity. Equation (4.14) follows from the relation solved by , the bound and the the fact that is strictly increasing on . Consequently, using (4.14) we get
[TABLE]
Hence the bound is finite and thus the radius of convergence of is .
Finally, the third inequality can be derived using again (4.14) and , we have
[TABLE]
Setting we deduce the final bound in (4.13), which is the same as (4.4) in the case of non-negative potential.
4.2. Inhomogeneous gas
Here we start from a homogeneous gas with fixed reference activity and then add an external potential . The grand-canonical partition function in some bounded domain becomes
[TABLE]
and the density is given by
[TABLE]
Eq. (4.18) can be brought into the form from Section 3: let
[TABLE]
then
[TABLE]
similarly for the partition function. It follows from the tree-graph inequality in [PY17] that if
[TABLE]
for some and all , then the limit
[TABLE]
exists and is given by the usual combinatorial formulas, with position-dependent activity given in (4.19).
It is a classical problem to ask whether, given a density profile , there exists a background potential such that the density profile in the associated grand-canonical ensemble is equal to the given profile . In view of (4.19), Theorem 3.5 has direct implications for this problem when activities converge. For results without cluster expansions, see [CCL84].
Theorem 4.5**.**
Fix and a pair potential with stability constant and lower bound . Let be a measurable function such that
[TABLE]
for all and some functions with pointwise. Then there exists a unique (up to null sets) background potential that satisfies (4.21) and such that for Lebesgue-almost all . It is given by
[TABLE]
with absolutely convergent integrals and sum.
A sufficient condition for (4.22) to hold true is that (pick ). In fact one easily checks that, if we are interested in bounded density profiles only, we are in the situation where a direct application of the Banach inversion theorem (Theorem 2.10) is possible.
Proof.
The absolute convergence of the series in (4.23) follows right away from Theorem 3.4 applied to . By Theorem 3.5, there is a unique measure in the domain of convergence such that , with the density at activity for the interaction potential . Moreover the activity is given by Eq. (3.12), which after plugging in becomes with
[TABLE]
We adopt (4.19) as a definition of the external potential, then and is given by (4.23). It satisfies by the definition (4.24) of and . Condition (4.21) follows rom () as then and thus (4.22) implies (4.21). ∎
4.3. Mixture of hard spheres
Consider a mixture of hard spheres with radii , for example, . The activity of the sphere depends on the type but otherwise the system is homogeneous. To bring the model into the form from Section 3, let , with representing a sphere of radius centered at . We consider measures informally given by . More precisely, for every non-negative test function . The interaction is hard core exclusion
[TABLE]
Let be the infinite-volume pressure and . A sufficient condition for the convergence of the activity expansion of the pressure is
[TABLE]
for some non-negative sequence of positive numbers and all , as is easily checked from [Uel04].
Theorem 4.6**.**
Suppose that satisfies
[TABLE]
for all and two sequences , with for all . Then there exists a unique sequence with for all and such that condition (4.25) holds. It is given by
[TABLE]
The coefficients are given by sums over -connected graphs as in (3.11). The sum in the exponential in (4.27), with absolute values inside the integral, is bounded by .
The theorem is deduced from Theorems 3.4 and 3.5, the details are left to the reader.
4.4. Flexible molecules. Liquid crystals
Finally we come to a system of objects with internal degrees of freedom: we assume that the space is of the form with a bounded domain.333We could also allow for spaces representing a multi-species system where each species has its own spin space , but for simplicity we stick to the single-species case. The space represents internal degrees of freedom (spin, orientation, shape of a molecule…). For example, we could take as the projective space (i.e., with identification of parallel vectors) and think of as a thin rod centered at with orientiation . Such a model is often used for the study of liquid crystals [Ons49].
Suppose we are given a reference measure on that is of the form , i.e., it is the product of the Lebesgue measure on and a reference measure on (e.g. a uniform measure on orientations of thin rods). To simplify formulas, we write instead of . The pair potential is a function of both position and internal degree of freedom.
Following Onsager, one could work in a multi-species canonical ensemble, where each species represents a discretized orientation. In such a setup, deriving the canonical free energy is immediate following [PT12]. One can easily derive a functional for continuous orientations, using our techniques presented here, that is, to start in the grand-canonical ensemble, and obtain the grand-canonical free energy via Legendre transform and inversion of the density-activity relation, which is precisely the definition (3.15) for . Let us write and, by a slight abuse of language, instead of .
For simplicity we prove results for non-negative pair potentials only but note that our general theorems lead just as easily to stable pair potential.
Theorem 4.7**.**
Let and . Suppose there exist weight functions with . Suppose that satisfies
[TABLE]
for all , and
[TABLE]
Then
[TABLE]
with absolutely convergent integrals and sum.
Proof.
The theorem is an immediate consequence of Theorem 3.6. ∎
When we think of rods with an orientiation, we may specialize to situations where there is translational invariance but not necessarily rotational invariance:
Corollary 4.8**.**
Assume that for some scalar and non-negative with . Assume that , , and
[TABLE]
Then
[TABLE]
with absolutely convergent integral and series.
If is translation invariant, then the right-hand side of (4.28) is proportional to the volume, up to boundary errors that become irrelevant in the thermodynamic limit, and the corollary also yields an expression for the thermodynamic limit .
The right-hand side of (4.28) corresponds to the functional from Eq. (27) in [Ons49], which is the free energy functional derived by Onsager before applying additional approximations due to thinness of rods etc.
Remark 4.9*.*
In [JTTU14], in order to obtain -connected coefficients for the case of molecules with internal degrees of freedom, we needed to assume rigidity of the molecules so that Lemma 4.1 in [JTTU14] about factorization of graph weights holds true. In the present article, as seen in Corollary 4.8, we obtain the -connected coefficients as well provided we keep the probability density of shapes as an explicit variable. If instead we look at
[TABLE]
expand the minimizer in powers of and compose with the expansion of , we see that the coefficient of in the expansion of is not given by .
Appendix A Formal power series and Ruelle’s algebraic formalism
Here we summarize some facts on the formal power series used in this article, and point out the relation with Ruelle’s algebraic formalism. We are interested in power series and formal power series of the form
[TABLE]
where is a measurable space is a measure on , and is a scalar, and are measurable maps that are invariant under permutation of the arguments.
In general, for a formal power series, the integrals and the series need not to converge, hence, in analogy with the theory of formal power series of a single variable, we define a formal power series as a sequence of symmetric functions and downgrade (A.1) to a mnemonic notation. Standard operations such as sums and products are defined directly as operations on the sequences in such a way that for two sufficiently well convergent power series one obtains the same result. The sum of two formal power series is the formal series with coefficients , for the formal series is the series with coefficients . Other operations are defined below. The resulting algebra of formal power series is exactly the algebra of symmetric functions introduced by Ruelle [Rue69, Chapter 4.4].
Product. Let be formal power series, then is defined by
[TABLE]
The empty set is explicitly allowed. As an operation on sequences of symmetric functions, this is exactly the convolution in [Rue69, Chapter 4.4]. It is not difficult to check that the product is commutative and associative. Eq. (A.2) generalizes to products as
[TABLE]
where the sum runs over ordered partitions of into disjoint parts, with explicitly allowed.
The definition (A.2) is motivated by the following computation, which is valid if the power series are absolutely convergent: From
[TABLE]
we get
[TABLE]
The summand for should be read as . The binomial coefficient is equal to the number of subsets of cardinality . The value of the integral
[TABLE]
depends on the cardinality of alone, and so we find that
[TABLE]
with defined in (A.2).
Variational derivative. For and a formal power series over , we define
[TABLE]
In the language of [Rue69, Chapter 4.4], corresponds to the derivation . Formally,
[TABLE]
and
[TABLE]
as it should be.
Composition I and exponential series. Let be a formal power series in a single variable and a formal power series on with . The formal power series on is defined by and for ,
[TABLE]
with the collection of set partitions of . Note that only because the expression (A.6) is well-defined, because only in this case the sum is finite. Formally,
[TABLE]
In the second line we have used (A.3). Because of , the only relevant contributions in the last line are from non-empty’s. The factor can be removed if we decide to sum over non-ordered partitions instead of ordered partitions , and we arrive at the expression (A.6) for the coefficients of .
An important special case is , for which Eq. (A.6) becomes
[TABLE]
which is exactly the exponential on the algebra of symmetric functions from [Rue69, Chapter 4.4].
Composition II. In the proof of Lemma 2.1 we need a more general type of composition, namely let be a formal power series on with and a family of power series
[TABLE]
If is absolutely convergent for each , define
[TABLE]
If sums and integrals are absolutely convergent, then
[TABLE]
where the can be empty. We group pairs with a common sum . For the factorials we note
[TABLE]
Exploiting the symmetry of the functions and , we find that the coefficients of are given by
[TABLE]
Appendix B Holomorphic functions on Banach spaces
Here we collect some fact that are useful for the Banach inversion. We refer the reader to [Har03, Muj06] for accessible surveys and [Din99, Muj86] for details. Let and be two complex Banach spaces. A multilinear map is bounded if
[TABLE]
Definition B.1** (Homogeneous polynomials and power series).**
- (1)
A mapping is a continuous -homogeneous polynomial if there exists a bounded multilinear map such that . 2. (2)
A power series from into is a series of the form , with and a continuous -homogeneous polynomial. The radius of uniform convergence of the series is the supremum over all such that the series converges uniformly on .
The norm of a continuous -homogeneous polynomial is
[TABLE]
For example, if and , then .
Proposition B.2** (Cauchy-Hadamard formula).**
[Muj06, Prop. 6]** The radius of uniform convergence of the power series satisfies
[TABLE]
Theorem B.3**.**
[Muj06, Theorem 7]** Let be a non-empty open subset and . The following conditions are equivalent:
- (1)
For each , the Fréchet derivative of at exists: i.e., there exists a bounded linear map such that
[TABLE] 2. (2)
For each , there exists a power series that converges to uniformly on some ball (with ). 3. (3)
* is continuous in and, for each , all elements of the dual Banach space , and all , the map is holomorphic in the usual sense in the open set .*
Definition B.4**.**
A mapping is called holomorphic if it satisfies one (hence, all three) of the conditions (1)-(3) in Theorem B.3.
Many theorems for holomorphic functions in have analogues (for example, Cauchy integral formulas), but there are a few pitfalls. For example, it is not true that the Taylor series of a function holomorphic on all of has infinite uniform radius of convergence. Also, it is not true that a holomorphic function is bounded on balls that are bounded away from .
Example B.5*.*
[Har03, Example 2.6] Let be the Banach space of complex-valued sequences that converge to zero, equipped with the usual supremum norm. Define by
[TABLE]
Then is holomorphic on all of , but the radius of uniform convergence (in the sense of Definition B.1) of the series is , and for every , the function is unbounded on the ball .
We conclude with a quantitative inverse function theorem. Let and open subset of and . Call . An inverse function theorem give condition under which there exist open neighborhoods of [math] and of , respectively, such that is bijection with holomorphic inverse. An quantitative inverse function theorem additionally singles out numbers and , which only depends on , for which we may choose and . Alternatively, one can have quantitative inversion theorem such that and . Such numbers and are sometimes called Bloch radii after Bloch’s theorem. In the following theorem .
Theorem B.6**.**
[Har77, Proposition 2]** Let and be open balls in some complex Banach space and a holomorphic function. Suppose that the derivative at the origin is invertible with bounded inverse . Let
[TABLE]
Then maps biholomorphically onto a domain covering .
Acknowledgments
The main part of this article was completed when the first and third authors were members of the Department of Mathematics at the University of Sussex and the second was frequently visiting; the authors acknowledge the department for the nice atmosphere. S. J. thanks the GSSI and T.K. the university in L’Aquila, Italy, for hospitality and M. Lewin for pointing out possible connections with the setting of the Nash-Moser theorem.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[BLL 98] F. Bergeron, G. Labelle, and P. Leroux, Combinatorial species and tree-like structures , Encyclopedia of mathematics and its applications, vol. 67, Cambridge University Press, 1998.
- 2[CCL 84] J. T. Chayes, L. Chayes, and E. H. Lieb, The inverse problem in classical statistical mechanics , Comm. Math. Phys. 93 (1984), no. 1, 57–121.
- 3[Din 99] S. Dineen, Complex analysis on infinite dimensional spaces , Springer Monographs in Mathematics, Springer, 1999.
- 4[Far 12] W. G. Faris, Biconnected graphs and the multivariate virial expansion , Markov Processes and Related Fields 18 (2012), no. 3, 357–386.
- 5[FPS 07] R. Fernández, A. Procacci, and B. Scoppola, The analyticity region of the hard sphere gas. Improved bounds , J. Stat. Phys. 128 (2007), no. 5, 1139–1143.
- 6[Ges 87] I. M. Gessel, A combinatorial proof of the multivariable Lagrange inversion formula , J. Combin. Theory Ser. A 45 (1987), no. 2, 178–195.
- 7[Ham 82] R. S. Hamilton, The inverse function theorem of Nash and Moser , Bull. Amer. Math. Soc. 7 (1982), no. 1, 65–122.
- 8[Har 77] L. A. Harris, On the size of balls covered by analytic transformations , Monatshefte für Mathematik 83 (1977), no. 1, 9–23.
