Virial inversion and density functionals

Sabine Jansen; Tobias Kuna; Dimitrios Tsagkarogiannis

arXiv:1906.02322·math-ph·September 11, 2019

Virial inversion and density functionals

Sabine Jansen, Tobias Kuna, Dimitrios Tsagkarogiannis

PDF

TL;DR

This paper introduces a new mathematical inversion theorem for functionals in infinite-dimensional spaces, with applications to density function theory and improved convergence estimates for the virial expansion in statistical mechanics.

Contribution

It develops a novel inversion method using fixed point equations and combinatorial identities, enhancing convergence analysis for density functionals and the virial expansion.

Findings

01

Proves a new inversion theorem for power series functionals.

02

Provides rigorous convergence framework for inhomogeneous systems.

03

Achieves improved radius of convergence for the virial expansion of the hard sphere gas.

Abstract

We prove a novel inversion theorem for functionals given as power series in infinite-dimensional spaces and apply it to the inversion of the density-activity relation for inhomogeneous systems. This provides a rigorous framework to prove convergence for density functionals for inhomogeneous systems with applications in classical density function theory, liquid crystals, molecules with various shapes or other internal degrees of freedom. The key technical tool is the representation of the inverse via a fixed point equation and a combinatorial identity for trees, which allows us to obtain convergence estimates in situations where Banach inversion fails. Moreover, the new method for the inversion gives for the (homogeneous) hard sphere gas a significantly improved radius of convergence for the virial expansion improving the first and up to now best result by Lebowitz and Penrose (1964).

Equations425

A_{n} (q; x_{σ (1)}, \dots, x_{σ (n)}) = A_{n} (q; x_{1}, \dots, x_{n}),

A_{n} (q; x_{σ (1)}, \dots, x_{σ (n)}) = A_{n} (q; x_{1}, \dots, x_{n}),

\sum_{n=1}^{\infty}\frac{1}{n!}\int_{\mathbb{X}^{n}}\bigl{|}A_{n}(q;x_{1},\ldots,x_{n})\bigr{|}\,|z|(\mathrm{d}x_{1})\cdots|z|(\mathrm{d}x_{n})<\infty

\sum_{n=1}^{\infty}\frac{1}{n!}\int_{\mathbb{X}^{n}}\bigl{|}A_{n}(q;x_{1},\ldots,x_{n})\bigr{|}\,|z|(\mathrm{d}x_{1})\cdots|z|(\mathrm{d}x_{n})<\infty

A (q; z) := n = 1 \sum \infty \frac{1}{n !} \int_{X^{n}} A_{n} (q; x_{1}, \dots, x_{n}) z (d x_{1}) \dots z (d x_{n}) (z \in D (A)) .

A (q; z) := n = 1 \sum \infty \frac{1}{n !} \int_{X^{n}} A_{n} (q; x_{1}, \dots, x_{n}) z (d x_{1}) \dots z (d x_{n}) (z \in D (A)) .

M_{C} \supset D (A) \to M_{C}, z \mapsto ρ [z]

M_{C} \supset D (A) \to M_{C}, z \mapsto ρ [z]

ρ [z] (d q) \equiv ρ (d q; z) := e^{- A (q; z)} z (d q),

ρ [z] (d q) \equiv ρ (d q; z) := e^{- A (q; z)} z (d q),

ν = ρ [z] \Leftrightarrow z = ζ [ν] .

ν = ρ [z] \Leftrightarrow z = ζ [ν] .

ζ [ν] (d q) \equiv ζ (d q; ν) = e^{A (q; ζ [ν])} ν (d q) .

ζ [ν] (d q) \equiv ζ (d q; ν) = e^{A (q; ζ [ν])} ν (d q) .

T_{q}^{\circ} (ν) \equiv T^{\circ} (q; ν) = e^{A (q; ζ [ν])}

T_{q}^{\circ} (ν) \equiv T^{\circ} (q; ν) = e^{A (q; ζ [ν])}

ζ [ν] (d q) = T_{q}^{\circ} (ν) ν (d q) = e^{A (q; ν T_{q}^{\circ} (ν))} ν (d q)

ζ [ν] (d q) = T_{q}^{\circ} (ν) ν (d q) = e^{A (q; ν T_{q}^{\circ} (ν))} ν (d q)

T_{q}^{\circ}(\nu)=\exp\Biggl{(}\sum_{n=1}^{\infty}\frac{1}{n!}\int_{\mathbb{X}^{n}}A_{n}(q;x_{1},\ldots,x_{n})T_{x_{1}}^{\circ}(\nu)\cdots T_{x_{n}}^{\circ}(\nu)\nu(\mathrm{d}x_{1})\cdots\nu(\mathrm{d}x_{n})\Biggr{)}.

T_{q}^{\circ}(\nu)=\exp\Biggl{(}\sum_{n=1}^{\infty}\frac{1}{n!}\int_{\mathbb{X}^{n}}A_{n}(q;x_{1},\ldots,x_{n})T_{x_{1}}^{\circ}(\nu)\cdots T_{x_{n}}^{\circ}(\nu)\nu(\mathrm{d}x_{1})\cdots\nu(\mathrm{d}x_{n})\Biggr{)}.

T_{q}^{\circ} (ν) = 1 + n = 1 \sum \infty \frac{1}{n !} \int_{X^{n}} t_{n} (q; x_{1}, \dots, x_{n}) ν (d x_{1}) \dots ν (d x_{n}) (q \in X)

T_{q}^{\circ} (ν) = 1 + n = 1 \sum \infty \frac{1}{n !} \int_{X^{n}} t_{n} (q; x_{1}, \dots, x_{n}) ν (d x_{1}) \dots ν (d x_{n}) (q \in X)

n = 1 \sum \infty \frac{1}{n !} \int_{X^{n}} B_{n} (q; x_{1}, \dots, x_{n}) ν (d x_{1}) \dots ν (d x_{n}) = n = 1 \sum \infty \frac{1}{n !} \int_{X^{n}} A_{n} (q; x_{1}, \dots, x_{n}) T_{x_{1}}^{\circ} (ν) \dots T_{x_{n}}^{\circ} (ν) ν (d x_{1}) \dots ν (d x_{n})

n = 1 \sum \infty \frac{1}{n !} \int_{X^{n}} B_{n} (q; x_{1}, \dots, x_{n}) ν (d x_{1}) \dots ν (d x_{n}) = n = 1 \sum \infty \frac{1}{n !} \int_{X^{n}} A_{n} (q; x_{1}, \dots, x_{n}) T_{x_{1}}^{\circ} (ν) \dots T_{x_{n}}^{\circ} (ν) ν (d x_{1}) \dots ν (d x_{n})

B_{n}(q;x_{1},\ldots,x_{n})=\sum_{m=1}^{n}\sum_{\begin{subarray}{c}J\subset[n]\\ \#J=m\end{subarray}}A_{m}\bigl{(}q;(x_{j})_{j\in J}\bigr{)}\sum_{\begin{subarray}{c}(V_{j})_{j\in J}:\\ \dot{\cup}_{j\in J}V_{j}=[n]\setminus J\end{subarray}}\prod_{j\in J}t_{\#V_{j}}\bigl{(}x_{j};(x_{v})_{v\in V_{j}}\bigr{)},

B_{n}(q;x_{1},\ldots,x_{n})=\sum_{m=1}^{n}\sum_{\begin{subarray}{c}J\subset[n]\\ \#J=m\end{subarray}}A_{m}\bigl{(}q;(x_{j})_{j\in J}\bigr{)}\sum_{\begin{subarray}{c}(V_{j})_{j\in J}:\\ \dot{\cup}_{j\in J}V_{j}=[n]\setminus J\end{subarray}}\prod_{j\in J}t_{\#V_{j}}\bigl{(}x_{j};(x_{v})_{v\in V_{j}}\bigr{)},

B_{1} (q; x_{1})

B_{1} (q; x_{1})

B_{2} (q; x_{1}, x_{2})

t_{n}(q;x_{1},\ldots,x_{n})=\sum_{m=1}^{n}\sum_{\{J_{1},\ldots,J_{m}\}\in\mathcal{P}_{n}}\prod_{\ell=1}^{m}B_{\#J_{\ell}}\bigl{(}q;(x_{j})_{j\in J_{\ell}}\bigr{)},

t_{n}(q;x_{1},\ldots,x_{n})=\sum_{m=1}^{n}\sum_{\{J_{1},\ldots,J_{m}\}\in\mathcal{P}_{n}}\prod_{\ell=1}^{m}B_{\#J_{\ell}}\bigl{(}q;(x_{j})_{j\in J_{\ell}}\bigr{)},

t_{1} (q; x_{1})

t_{1} (q; x_{1})

t_{2} (q; x_{1}, x_{2})

n = 1 \sum \infty \frac{1}{n !} \int_{X^{n}} ∣ A_{n} (q; x_{1}, \dots, x_{n}) ∣ e^{\sum_{j = 1}^{n} b (x_{j})} ∣ ν ∣ (d x_{1}) \dots ∣ ν ∣ (d x_{n}) \leq b (q) .

n = 1 \sum \infty \frac{1}{n !} \int_{X^{n}} ∣ A_{n} (q; x_{1}, \dots, x_{n}) ∣ e^{\sum_{j = 1}^{n} b (x_{j})} ∣ ν ∣ (d x_{1}) \dots ∣ ν ∣ (d x_{n}) \leq b (q) .

1 + n = 1 \sum \infty \frac{1}{n !} \int_{X^{n}} ∣ t_{n} (q; x_{1}, \dots, x_{n}) ∣ ∣ ν ∣ (d x_{1}) \dots ∣ ν ∣ (d x_{n}) \leq e^{b (q)}

1 + n = 1 \sum \infty \frac{1}{n !} \int_{X^{n}} ∣ t_{n} (q; x_{1}, \dots, x_{n}) ∣ ∣ ν ∣ (d x_{1}) \dots ∣ ν ∣ (d x_{n}) \leq e^{b (q)}

S_{q}^{N} (ν) := 1 + n = 1 \sum N \frac{1}{n !} \int_{X^{n}} ∣ t_{n} (q; x_{1}, \dots, x_{n}) ∣ ∣ ν ∣ (d x_{1}) \dots ∣ ν ∣ (d x_{n}) .

S_{q}^{N} (ν) := 1 + n = 1 \sum N \frac{1}{n !} \int_{X^{n}} ∣ t_{n} (q; x_{1}, \dots, x_{n}) ∣ ∣ ν ∣ (d x_{1}) \dots ∣ ν ∣ (d x_{n}) .

S_{q}^{N} (ν)

S_{q}^{N} (ν)

\displaystyle\leq\exp\Biggl{(}\sum_{n=1}^{N-1}\frac{1}{n!}\int_{\mathbb{X}^{n}}\bigl{|}A_{n}(q;x_{1},\ldots,x_{n})\bigr{|}\mathrm{e}^{b(x_{1})+\cdots+b(x_{n})}\ |\nu|(\mathrm{d}x_{1})\cdots|\nu|(\mathrm{d}x_{n})\Biggr{)}

\leq e^{b (q)} .

b (q) := lo g T_{q}^{\circ} (ν) .

b (q) := lo g T_{q}^{\circ} (ν) .

V_{b} := {ν \in M_{C} ∣ ν satisfies condition \eqref suff1} .

V_{b} := {ν \in M_{C} ∣ ν satisfies condition \eqref suff1} .

ζ [ν] (d q) = ζ (d q; ν) := T_{q}^{\circ} (ν) ν (d q) .

ζ [ν] (d q) = ζ (d q; ν) := T_{q}^{\circ} (ν) ν (d q) .

ρ (d q; z)

ρ (d q; z)

= e^{- A (q; ζ [ν])} T_{q}^{\circ} (ν) ν (d q) = ν (d q) .

w\bigl{(}T,(P_{i})_{0\leq i\leq n};x_{0},x_{1},\ldots,x_{n}\bigr{)}:=\prod_{i=0}^{n}\prod_{J\in P_{i}}A_{\#J+1}\bigl{(}x_{i};(x_{j})_{j\in J}\bigr{)}

w\bigl{(}T,(P_{i})_{0\leq i\leq n};x_{0},x_{1},\ldots,x_{n}\bigr{)}:=\prod_{i=0}^{n}\prod_{J\in P_{i}}A_{\#J+1}\bigl{(}x_{i};(x_{j})_{j\in J}\bigr{)}

T_{q}^{\circ}(z)=1+\sum_{n=1}^{\infty}\frac{1}{n!}\int_{\mathbb{X}^{n}}\sum_{(T,(P_{i})_{i=0,\ldots,n})\in\mathcal{TP}_{n}^{\circ}}w\bigl{(}T,(P_{i})_{i=0,\ldots,n};q,x_{1},\ldots,x_{n}\bigr{)}z^{n}(\mathrm{d}\boldsymbol{x}).

T_{q}^{\circ}(z)=1+\sum_{n=1}^{\infty}\frac{1}{n!}\int_{\mathbb{X}^{n}}\sum_{(T,(P_{i})_{i=0,\ldots,n})\in\mathcal{TP}_{n}^{\circ}}w\bigl{(}T,(P_{i})_{i=0,\ldots,n};q,x_{1},\ldots,x_{n}\bigr{)}z^{n}(\mathrm{d}\boldsymbol{x}).

\tilde{t}_{n}(q;x_{1},\ldots,x_{n}):=\sum_{(T,(\mathcal{P}_{i})_{i=0,\ldots,n})\in\mathcal{TP}_{n}^{\circ}}w\bigl{(}T,(P_{i})_{0\leq i\leq n};q,x_{1},\ldots,x_{n}\bigr{)}.

\tilde{t}_{n}(q;x_{1},\ldots,x_{n}):=\sum_{(T,(\mathcal{P}_{i})_{i=0,\ldots,n})\in\mathcal{TP}_{n}^{\circ}}w\bigl{(}T,(P_{i})_{0\leq i\leq n};q,x_{1},\ldots,x_{n}\bigr{)}.

\tilde{t}_{n}(q;x_{1},\ldots,x_{n})=\sum_{m=1}^{n}\sum_{\{J_{1},\ldots,J_{m}\}\in\mathcal{P}_{n}}\prod_{\ell=1}^{m}\tilde{B}_{\#J_{\ell}}\bigl{(}q;(x_{j})_{j\in J_{\ell}}\bigr{)}.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

Virial inversion and density functionals

Sabine Jansen

Mathematisches Institut, Ludwig-Maximilians-Universität, 80333 München, Germany

[email protected]

,

Tobias Kuna

Department of Mathematics and Statistics, University of Reading, Reading RG6 6AX, UK

[email protected]

and

Dimitrios Tsagkarogiannis

Dipartimento di Ingegneria e Scienze dell’Informazione e Matematica, Università degli Studi dell’Aquila, 67100 L’Aquila, Italy

[email protected]

(Date: 30 August 2019)

Abstract.

We prove a novel inversion theorem for functionals given as power series in infinite-dimensional spaces and apply it to the inversion of the density-activity relation for inhomogeneous systems. This provides a rigorous framework to prove convergence for density functionals for inhomogeneous systems with applications in classical density function theory, liquid crystals, molecules with various shapes or other internal degrees of freedom. The key technical tool is the representation of the inverse via a fixed point equation and a combinatorial identity for trees, which allows us to obtain convergence estimates in situations where Banach inversion fails. Moreover, the new method for the inversion gives for the (homogeneous) hard sphere gas a significantly improved radius of convergence for the virial expansion improving the first and up to now best result by Lebowitz and Penrose (1964).

Keywords: cluster and virial expansions – density functional theory – holomorphic functions in Banach spaces

MSC 2010 classification: 82B05, 82D15, 82D30, 47J07, 05C05

1 Introduction
2 General inversion theorems
2.1 Main inversion theorem with proof
2.2 Scale of Banach spaces. Banach inversion
2.3 An equivalent fixed point equation
3 Virial expansion. Density functional
4 Examples
4.1 Homogeneous gas
4.2 Inhomogeneous gas
4.3 Mixture of hard spheres
4.4 Flexible molecules. Liquid crystals
A Formal power series and Ruelle’s algebraic formalism
B Holomorphic functions on Banach spaces

1. Introduction

Deriving functional expressions for thermodynamic quantities from microscopic models which are based on physical principles is one of the main challenges of both theoretical and computational methods in statistical mechanics. Furthermore, the use of such functionals is ubiquitous in applied mathematics for example in classical density function theory, liquid crystals, heterogenous materials, colloid systems, system of molecules with various shapes or other internal degrees of freedom. However, often the key point in variational calculus and the theory of PDE is to consider non constant densities and hence non translation invariant systems. One key mathematically rigorous result in this direction was the proof of the convergence of the virial expansion by Lebowitz and Penrose in 1964 [LP64], building on the previously established convergence of the activity expansion of the pressure and of the density. The proof consists out of three main steps: first to invert the density-activity relation, second to plug the resulting expansion of the activity as a function of the density into the pressure-activity expansion and resum, and finally bound the radius of convergence of the composed power series combining convergence results for the inversions and for activity expansions. Previous results [MGM40], based on formal manipulations of power series and combinatorics of graphs, had already identified the coefficients in the density series in terms of two-connected (“irreducible”) graphs. A by-product of the convergence result from [LP64] is the absolute convergence of the generating function for two-connected graphs, thus justifying formulas that were already in use.

This recipe for going from activity expansions to density expansions extends to quantities whose activity expansion is well understood, for example, the truncated correlation functions. However convergence proofs for other quantities are more delicate, as explained in detail in [KT18] for the direct correlation functions. Indeed, even though combinatorial series for various quantities are available, their derivation rests on formal manipulations and graph re-summations that have yet to be rigorously justified. The formal graph re-summations were developed in the 60’s mainly by the works of Morita and Hiroike [MH60, MH61] and of Stell [Ste64] on liquid state theory expansions for inhomogeneous fluids, allowing for position-dependent densities. In contrast, the convergence result from [LP64] and all subsequent works addresses homogeneous systems only.

Our goal, therefore, is twofold:

(1)

Establish the validity of the inversion formulas for inhomogeneous fluids. 2. (2)

Prove the validity of re-summation operations on graphs by showing that the resulting power series are absolutely convergent.

As far as goal (2) is concerned, in a previous work [KT18] we proved convergence for some resummation of expansions leading to graphs with higher connectivity properties, but starting from the canonical ensemble. That choice was made in order to avoid the graph re-summations that come with the inversion, but also since it is more natural for expansions with respect to the density. In the current paper we prove the validity of these re-summations by inverting the density-activity relation, but a similar structure may be expected for other types of resummations as considered, e.g. inverting the truncated correlation vs activity relation. We intend to address all these issues in a subsequent work.

Concerning goal (1), since inhomogeneous system can be seen as a system of uncountably many species, when one considers the position $x\in\mathbb{R}^{d}$ as species. We consider goal (1) in this more general context of a system of (potentially uncountably many) species. In this way, we can treat at the same time as well systems of mixtures as with internal degrees of freedom. This generalization will not increase the complexity of the arguments involved.

At first sight, it may look as if goal (1) is best achieved with the help of inverse function theorems in complex Banach spaces, applied to the functional that maps the activity profile $(z(x))_{x\in\Lambda}$ to the density profile $(\rho(x))_{x\in\Lambda}$ , see Section 2.2. This works well for inhomogeneous systemsof e.g. objects of bounded size, e.g., hard spheres of fixed radius. It turns out, however, that Banach inversion fails for mixtures of objects of finite but unlimited size [JTTU14, Jan15], see Example 2.7. As a way out, mixtures of countably many species were treated with the help of Lagrange-Good inversion in [JTTU14], leaving the case of uncountably many species wide open.

Our first main result is a novel inversion theorem (Theorem 2.5) that addresses the above-mentioned difficulties and bypasses both Banach and Lagrange-Good inversion. The novelty is two-fold. First, we work on the level of formal series and relate the formal inverse to generating functions of trees or equivalently, solutions of certain formal fixed point problems (Proposition 2.6). This part is inspired by the combinatorial proof of the Lagrange-Good formula for finitely many variables given in [Ges87], we will consider this relation in more details in a forthcoming work. Second, we provide sufficient conditions for the convergence of the formal inverse, i.e., of a generalized tree generating functions (Theorem 2.3). The inversion theorem is of an abstract general nature and has the potential of being applied to other situations than the density-activity relation in statistical mechanics.

In our second group of results (Section 3), we apply the abstract inversion theorem to the concrete problem of inverting the functional that maps the activity profile in an inhomogeneous grand-canonical Gibbs measure (or even a general multi-species system) to the density profile. We exhibit domains on which the activity profile is written as a convergent series in the density profile, relate the coefficients to two-connected graphs, and show that the virial expansion for the pressure as a functional of the position-dependent density profile converges and is indeed given in terms of two-connected graphs (Theorem 3.5). These results work for general stable pair potentials.

Finally in Section 4 we apply the results to different more concrete choices of pair potentials. We demonstrate the power of our approach for systems of homogeneous hard spheres, our results yield a significant improvement over previously available bounds (Theorem 4.1). For general non-negative potentials the improvement is almost 27%. For mixtures of thin rods with different orientiations, we obtain a series representation of the (grand-canonical) free energy as a function of the overall density $\rho_{0}$ of rods and the probability density $p(\sigma)$ on different orientations (Theorem 4.7 and Corollary 4.8). In fact, in an early work, Onsager [Ons49] derived a density functional for liquid crystals, keeping track of the orientation of the atomistic elongated molecules. Working in the canonical ensemble he discretized the space of orientations and assigned each value to a species obtaining a multi-species canonical partition function for (for finitely many) species. Following the new developments [PT12], the convergence of this expansion can be easily proved to be valid in the low density regime. Our result allows for a direct treatment of continuous values of the orientation as inhomogeneous systems. It bypasses the need to estimate errors from discretizing the orientation space, at the price of a detour through the grand-canonical ensemble. The improvements we are obtain are purely due to the improved inversion results as we used the classical tree-graph bound in the grand-canonical ensemble [Pen63], [Rue69],[MM91], [PU09], [PY17] and for marked systems [Kun01].

Following the above discussion we summarize below the main outcomes of this paper:

(1)

Proof of a novel inversion theorem (Theorem 2.3), applicable to the inversion of the density-activity relation for inhomogeneous systems (Section 3), yielding a convergent power series of the inverse map. 2. (2)

Key technical tool: a fixed point equation for generating functions of special trees (Proposition 2.6). 3. (3)

Various applications: inhomogeneous gas, liquid crystals, molecules with various shapes (internal degrees of freedom), see Section 4. 4. (4)

Comparison to existing theorems of inversion in Banach spaces (Proposition 2.8 and Theorem 2.10). 5. (5)

Discussion of the improvement of the radius of convergence for the (homogeneous) hard sphere gas (Section 4.1).

2. General inversion theorems

2.1. Main inversion theorem with proof

Let $(\mathbb{X},\mathcal{X})$ be a measurable space and $\mathfrak{M}(\mathbb{X},\mathcal{X})$ the set of $\sigma$ -finite non-negative measures on $(\mathbb{X},\mathcal{X})$ . Further let $\mathfrak{M}_{\mathbb{C}}(\mathbb{X},\mathcal{X})$ be the set of complex linear combinations of measures in $\mathfrak{M}(\mathbb{X},\mathcal{X})$ . When there is no risk of confusion, we shall write $\mathfrak{M}$ and $\mathfrak{M}_{\mathbb{C}}$ for short. Suppose we are given a family of measurable functions $A_{n}:\mathbb{X}\times\mathbb{X}^{n}\to\mathbb{C}$ , $(q,(x_{1},\ldots,x_{n}))\mapsto A_{n}(q;x_{1},\ldots,x_{n})$ . We assume that each $A_{n}$ is symmetric in the $x_{j}$ ’s, i.e.,

[TABLE]

for all permutations $\sigma\in\mathfrak{S}_{n}$ . When we say that a power series converges absolutely, we mean that

[TABLE]

where $|z|$ is the total variation of $z$ 111If $z=\mu_{1}-\mu_{2}+\mathrm{i}\mu_{3}-\mathrm{i}\mu_{4}$ with $\mu_{1},\ldots,\mu_{4}$ mutually singular $\sigma$ -finite non-negative measures, then $|z|=\sum_{i=1}^{n}\mu_{i}$ . Let $\mathscr{D}(A)\subset\mathfrak{M}_{\mathbb{C}}$ be the domain of convergence of the associated power series, that is $z\in\mathscr{D}(A)$ if and only if the power series converges absolutely in the above sense. We set

[TABLE]

We are interested in maps of the form

[TABLE]

given by

[TABLE]

where $\rho(\mathrm{d}q;z)$ is just a notation for $\rho[z](\mathrm{d}q)$ . The latter is useful whenever one wants to stress the $q$ instead of the $z$ dependence. Thus $\rho[z]$ is absolutely continuous with respect to $z$ with Radon-Nikodým derivative $\exp(-A(q;z))$ . We want to determine the inverse map $\nu\mapsto\zeta[\nu]$ ,

[TABLE]

Suppose for a moment that such an inverse map exists. Clearly $z$ is equivalent to $\nu=\rho[z]$ with Radon-Nikodým derivative $\exp(A(q;z))$ . Consequently we should have

[TABLE]

This observation is the starting point for our inversion result, namely the family of power series $(T_{q}^{\circ})_{q\in\mathbb{X}}$ given by

[TABLE]

should solve

[TABLE]

and therefore

[TABLE]

In Proposition 2.6 below we provide a combinatorial interpretation of $T_{q}^{\circ}$ as the exponential generating function for colored rooted, labelled trees whose root is a ghost of color $q$ (i.e., the root does not come with powers of $\nu$ in the generating function). For our main inversion theorem, however, it is enough to know that the fixed point equation ( $\mathsf{FP}$ ) determines the power series $(T_{q}^{\circ})_{q\in\mathbb{X}}$ uniquely.

Lemma 2.1.

There exists a uniquely defined family of formal power series

[TABLE]

with $t_{n}:\mathbb{X}\times\mathbb{X}^{n}\to\mathbb{C}$ measurable and symmetric in the $x_{j}$ ’s, that solves ( $\mathsf{FP}$ ) in the sense of formal power series.

As the above expressions are interpreted in the sense of formal power series, neither the series need to converge nor the integrals need to exist.

Proof.

Set $t_{0}:=1$ . Let $B_{n}(q;x_{1},\ldots,x_{n})$ be the coefficients of the series in the exponential in ( $\mathsf{FP}$ ), i.e., each $B_{n}:\mathbb{X}\times\mathbb{X}^{n}\to\mathbb{C}$ is measurable, and we have

[TABLE]

in the sense of formal power series. Then

[TABLE]

see Eq. (A.8) in Appendix A. The third sum is over ordered partitions $(V_{j})_{j\in J}$ of $[n]\setminus J$ , indexed by $J$ , into $\#J$ disjoint sets $V_{j}$ , with $V_{j}=\varnothing$ explicitly allowed. For example,

[TABLE]

More generally, $B_{n}(q;\cdot)$ depends on $t_{1}(q;\cdot),\ldots,t_{n-1}(q;\cdot)$ alone. This is the only aspect of (2.9) that enters the proof of this lemma.

For $n\in\mathbb{N}$ , let $\mathcal{P}_{n}$ be the collection of set partitions of $\{1,\ldots,n\}$ . The family $(T_{q}^{\circ})_{q\in\mathbb{X}}$ solves ( $\mathsf{FP}$ ) in the sense of formal power series if and only if for all $n\in\mathbb{N}$ and $q,x_{1},\ldots,x_{n}\in\mathbb{X}^{n}$ , we have

[TABLE]

see Eq. (A.7) in Appendix A. In particular,

[TABLE]

which determines $t_{1}$ and $t_{2}$ uniquely. A straightforward induction over $n$ , exploiting that the right-hand side of (2.10) depends on $t_{1},\ldots,t_{n-1}$ alone (via $B_{1}$ ,…, $B_{n}$ ), shows that the system of equations (2.10) has a unique solution $(t_{n})_{n\in\mathbb{N}}$ . ∎

*Remark 2.2**.*

The proof of Lemma 2.1 shows that the coefficients $(t_{n})_{n\in\mathbb{N}}$ can be computed recursively.

Next we provide a sufficient condition for the absolute convergence of the series $T_{q}^{\circ}(\nu)$ .

Theorem 2.3.

Let $T_{q}^{\circ}(\nu)$ be the unique solution of ( $\mathsf{FP}$ ) from Lemma 2.1. Assume that for some measurable function $b:\mathbb{X}\to[0,\infty)$ , the measure $\nu\in\mathfrak{M}_{\mathbb{C}}$ satisfies, for all $q\in\mathbb{X}$ ,

[TABLE]

Then, for all $q\in\mathbb{X}$ , we have that

[TABLE]

and the fixed point equation ( $\mathsf{FP}$ ) holds true as an equality of absolutely convergent series.

Proof.

The inductive proof is similar to [Uel04, PU09]. Let $S_{q}^{N}(\nu)$ , $N\in\mathbb{N}_{0}$ , be the partial sums for the left-hand side of ( $\mathscr{M}_{b}$ ),

[TABLE]

We prove $S_{q}^{N}(\nu)\leq\mathrm{e}^{b(q)}$ by induction on $N$ , building on the proof of Lemma 2.1. The estimate for the full series then follows by a passage to the limit $N\to\infty$ .

For $N=0$ , we have $S_{q}^{0}(\nu)=1$ and the inequality $S_{q}^{0}(\nu)\leq\exp(b(q))$ is trivial. Now assume $S_{q}^{N-1}(\nu)\leq\exp(b(q))$ . The triangle inequality applied to Eqs. (2.9) and (2.10) yields the same iterative formula for $\left|t_{n}(q;x_{1},\ldots,x_{n})\right|$ as for $t_{n}(q;x_{1},\ldots,x_{n})$ just with $A_{n}(q;x_{1},\ldots,x_{n})$ replaced by $\bigl{|}A_{n}(q;x_{1},\ldots,x_{n})\bigr{|}$ . We noted before that, if we consider $S_{q}^{N}(\nu)$ and hence only $\left|t_{n}(q;x_{1},\ldots,x_{n})\right|$ for $n\leq N$ , then on the right hand side only $\left|t_{n}(q;x_{1},\ldots,x_{n})\right|$ with $n\leq N-1$ appear. However, there are some terms on the right hand side, which as well only contain $\left|t_{n}(q;x_{1},\ldots,x_{n})\right|$ with $n\leq N-1$ but which come from some term $\left|t_{n}(q;x_{1},\ldots,x_{n})\right|$ on the left hand side for $n>N$ . Adding these missing terms, we reconstruct an exponential on the right hand side. As all of these additional terms are non-negative, we get the following inequality, instead of an equality

[TABLE]

The induction is complete. It follows that ( $\mathscr{M}_{b}$ ) holds true. In particular, the series $T_{q}^{\circ}(\nu)$ is absolutely convergent and satisfies $|T_{q}^{\circ}(\nu)|\leq\exp(b(q))$ . By condition ( $\mathscr{S}_{b}$ ), the right-hand side of the fixed point equation ( $\mathsf{FP}$ ) is absolutely convergent as well. Therefore Eq. ( $\mathsf{FP}$ ) holds true not only as an identity of formal power series but in fact as an identity of well-defined complex-valued functions. ∎

*Remark 2.4**.*

For non-negative functions $A_{n}$ , the convergence estimate is sharp, in the following sense: If $\nu\in\mathfrak{M}$ is a non-negative measure and $T_{q}^{\circ}(\nu)<\infty$ , then there exists a function $b:\mathbb{X}\to[0,\infty)$ such that ( $\mathscr{M}_{b}$ ) holds true. Indeed, an induction over $n$ , based on Eqs. (2.9) and (2.10), shows that if the $A_{n}$ ’s are non-negative, then the coefficients $B_{n}$ and $t_{n}$ are non-negative as well. If $T_{q}^{\circ}(\nu)<\infty$ , we may define

[TABLE]

Notice $b(q)\geq 0$ because of $T_{q}^{\circ}(\nu)\geq 1$ for non-negative $t_{n}$ and $\nu$ . It follows from ( $\mathsf{FP}$ ) that the inequality ( $\mathscr{S}_{b}$ ) holds true and is in fact an equality. This was already noticed in [Jan18, Proposition 2.9] and the proof of Theorem 4.2(b) in [Jan15].

Now that we have addressed the convergence of the series $T_{q}^{\circ}$ , we may come back to the inversion of the map $\mathscr{D}(A)\ni z\mapsto\rho[z]$ . For measurable $b:\mathbb{X}\to[0,\infty)$ , let

[TABLE]

For $\nu\in\mathscr{V}_{b}$ , define $\zeta[\nu]\in\mathfrak{M}_{\mathbb{C}}$ by

[TABLE]

Theorem 2.5.

For every weight function $b:\mathbb{X}\to\mathbb{R}_{+}$ , there is a set $\mathscr{U}_{b}\subset\mathscr{D}(A)$ such that $\rho:\mathscr{U}_{b}\to\mathscr{V}_{b}$ is a bijection with inverse $\zeta$ .

Proof.

Let $\mathscr{U}_{b}$ be the image of $\mathscr{V}_{b}$ under $\zeta$ . By Theorem 2.3, the set $\mathscr{U}_{b}$ is contained in $\mathscr{D}(A)$ , in particular if $z=\zeta[\nu]$ with $\nu\in\mathscr{V}_{b}$ , then $\rho[z]$ is well-defined with

[TABLE]

For the last identity we have used the fixed point equation ( $\mathsf{FP}$ ). Thus we have checked that if $z=\zeta[\nu]$ , with $\nu\in\mathscr{V}_{b}$ , then $\rho[z]=\nu$ . Conversely, if $\nu=\rho[z]$ with $z\in\mathscr{U}_{b}$ , then by definition of $\mathscr{U}_{b}$ there exists $\mu\in\mathscr{V}_{b}$ such that $z=\zeta[\mu]$ , hence $\nu=\rho[z]=\rho[\zeta[\mu]]=\mu\in\mathscr{V}_{b}$ and $z=\zeta[\mu]=\zeta[\nu]$ . ∎

Finally we provide a combinatorial formula for the function $T_{q}^{\circ}(\nu)$ appearing in the inverse $\zeta[\nu]$ . Consider a genealogical tree that keeps track not only of mother-child relations, but also of groups of siblings born at the same time. This results in a tree for which children of a vertex are partitioned into cliques (singletons, twins, triplets, etc.). Accordingly for $n\in\mathbb{N}$ we define $\mathcal{TP}_{n}^{\circ}$ as the set of pairs $(T,(P_{i})_{0\leq i\leq n})$ consisting of:

•

A tree $T$ with vertex set $[n]:=\{0,1,\ldots,n\}$ . The tree is considered rooted in [math] (the ancestor).

•

For each vertex $i\in\{0,1,\ldots,n\}$ , a set partition $P_{i}$ of the set of children222The members of the partition are assumed to be non-empty, except we consider the partition of the empty set. of $i$ . If $i$ is a leaf (has no children), then we set $P_{i}=\varnothing$ .

For $x_{0},\ldots,x_{n}\in\mathbb{X}$ , we define the weight of an enriched tree $(T,(P_{i})_{0\leq i\leq n})\in\mathcal{TP}_{n}^{\circ}$ as

[TABLE]

with $\prod_{J\in\varnothing}=1$ . So the weight of an enriched tree is a product over all cliques of twins, triplets, etc., contributing each a weight that depends on the variables $x_{j}$ of the clique members and the variable $x_{i}$ of the parent.

Proposition 2.6.

The family of power series $(T_{q}^{\circ})_{q\in\mathbb{X}}$ from Lemma 2.1 is given by

[TABLE]

Proof.

We check that the generating function of the weighted enriched trees satisfies ( $\mathsf{FP}$ ). Functional equations for generating functions of labelled trees are standard knowledge [BLL98], we provide a self-contained proof for the reader’s convenience. Define

[TABLE]

Further define $\tilde{B}_{n}(q;x_{1},\ldots,x_{n})$ but restricting the sum to enriched trees for which $\#P_{0}=1$ (all children of the root belong to the same clique). Further set $t_{0}=1$ and $\tilde{B}_{0}=0$ . For $V\subset\mathbb{N}$ a finite non-empty set, define $\mathcal{TP}^{\circ}(V)$ in the same way as $\mathcal{TP}^{\circ}_{n}$ but with $\{0,1,\ldots,n\}$ replaced by $\{0\}\cup V$ . For $V=\varnothing$ we define $\mathcal{TP}^{\circ}(V)=\varnothing$ and assign the empty tree the weight $1$ . For non-empty trees, weights $w(R;(x_{j})_{j\in V\cup\{0\}})$ are defined in complete analogy with (2.13).

Clearly there is a bijection between enriched trees $R\in\mathcal{TP}_{n}^{\circ}$ and set partitions $\{J_{1},\ldots,J_{m}\}$ of $[n]:=\{1,\ldots,n\}$ together with enriched trees $R_{i}\in\mathcal{TP}^{\circ}(J_{i})$ , $i=1,\ldots,m$ for which all children of the root are in the same clique. Indeed, the number $m$ corresponds to the number of cliques in which the children of the root are divided and the blocks $J_{1},\ldots,J_{m}$ group descendants of the root, where $J_{k}$ contains the children of the root which are in the $k$ -th. clique and all their decedents. The weight of an enriched tree $R$ is equal to the product of the weights of the subtrees $R_{i}$ . Therefore

[TABLE]

Furthermore there is a one-to-one correspondence between, on the one hand, enriched trees where all the children of the root are in the same clique and on the other hand tuples $(J,(V_{j})_{j\in J},(R_{j})_{j\in J})$ consisting of non-empty set $J\subset[n]$ , an ordered partition $(V_{j})_{j\in J}$ of $[n]\setminus J$ (with $V_{j}=\varnothing$ allowed), and a collection of enriched trees $R_{j}\in\mathcal{TP}^{\circ}(V_{j})$ . Overall, $J$ and $(V_{j})_{j\in J}$ give a partition of $[n]$ . The set $J$ consists of the labels of the children of the root, that is the one clique which all these children form and for each $j\in J$ , the set $V_{j}$ consists of the labels of the descendants of $j$ . ( $V_{j}=\varnothing$ means that $j$ is a leave of the tree) It follows that

[TABLE]

It follows from Eqs. (2.14) and (2.15) that the formal power series with coefficients $\tilde{t}_{n}$ solves ( $\mathsf{FP}$ ), therefore Lemma 2.1 yields $\tilde{t}_{n}=t_{n}$ . ∎

2.2. Scale of Banach spaces. Banach inversion

Formally, one is tempted to say that $\rho[z]$ is given by a power series with leading order $z$ , hence differentiable with derivative at the origin given by the identity matrix; therefore the existence and regularity of the inverse map should follow from some general inverse function theorem. When $\mathbb{X}$ is finite so that $z$ can be identified with a finite vector $(z_{x})_{x\in\mathbb{X}}\in\mathbb{C}^{n}$ , with $n=\#\mathbb{X}$ , this can be implemented and is indeed a standard ingredient for the virial expansion for single-species systems [LP64].

For infinite spaces $\mathbb{X}$ one may try a Banach inversion theorem. This works in some cases (see Theorem 2.10 below), but there are situations where the Banach inversion theorem is doomed to fail, as illustrated by the following example. The example is inspired by concrete features of the multi-species Tonks model [Jan15] for rods of unbounded lengths $\ell_{k}=k$ .

*Example 2.7**.*

Let $\mathbb{X}=\mathbb{N}$ and identify measures on $\mathbb{X}$ with sequences $(z_{k})_{k\in\mathbb{N}}$ . Consider the map $(z_{k})\mapsto(\rho_{k})$ given by

[TABLE]

Let $\ell^{\infty}(\mathbb{N})$ be the space of bounded complex-valued sequences equipped with the supremum norm and $X_{c}$ the space of sequences $(\nu_{k})$ with $||\nu||_{c}:=\sup_{k\in\mathbb{N}}|\nu_{k}|\exp(-ck)<\infty$ , for some fixed scalar $c>0$ . We may view $(z_{k})\mapsto(\rho_{k})$ as a map from the open ball $B(0,c)\subset\ell^{\infty}(\mathbb{N})$ to $X_{c}$ . The derivative $\mathrm{D}\rho(0)$ is the identity map or more precisely, the embedding $\iota:\ell^{\infty}(\mathbb{N})\to X_{c}$ , $\iota(h):=h$ . It is injective and continuous but it does not have a continuous inverse, therefore Banach inversion theorems are not applicable. The issue arises because the norms $||\cdot||_{\infty}$ and $||\cdot||_{c}$ are not equivalent. A target space with inequivalent norm is needed because, for every $z_{1}<0$ —no matter how small— $|\rho_{k}|\gg|z_{k}|$ as $k\to\infty$ .

It turns out that the natural analytic framework for our inversion theorem uses not a single Banach space, but instead a scale of Banach spaces, as is the case for the Nash-Moser theorem [Ham82, Sec16]. We explain this aspect in more detail here as this clarifies the issues raised in [JTTU14, Section 2.2] and [Jan15, Theorem 2.8].

Let us fix a reference measure $m\in\mathfrak{M}(\mathbb{X},\mathcal{X})$ and we restrict to measures that are absolutely continuous with respect to $m$ . Remember that $\rho[z](\mathrm{d}x)$ is absolutely continuous with respect to the measure $z(\mathrm{d}x)$ , so if $z$ is absolutely continuous with respect to $m$ , then so is $\rho[z]$ . We work with the Radon-Nikodým derivatives rather than the measures and write

[TABLE]

similarly for $\nu$ and $\zeta$ . Fix a weight function $b:\mathbb{X}\to\mathbb{R}_{+}$ and assume that $m$ satisfies condition ( $\mathscr{S}_{b}$ ). Let $L^{\infty}(\mathbb{X},m)$ be the space of bounded functions (precisely, equivalence classes up to $m$ -null sets), equipped with the supremum norm

[TABLE]

Write $B_{r}(0)$ for open balls of radius $r$ centered at [math]. For $h:\mathbb{X}\to\mathbb{C}$ measurable and $k\in\mathbb{Z}$ , define the weighted supremum norm

[TABLE]

and let $Y_{kb}$ be the associated Banach space. Notice the inclusions

[TABLE]

When $b$ is essentially bounded, then the inclusions are equalities and the norms $||\cdot||_{kb}$ , $||\cdot||_{\infty}$ are equivalent. For $||b||_{\infty}=\infty$ , the inclusions are strict and the norms are inequivalent. Let $B(0,r)$ and $B_{kb}(0,r)$ be the open balls of radius $r$ , centered at the origin, in $L^{\infty}(\mathbb{X},m)$ and $Y_{kb}$ , respectively.

Proposition 2.8.

Assume that $m\in\mathfrak{M}$ satisfies condition ( $\mathscr{S}_{b}$ ). Then the maps

[TABLE]

are holomorphic, as maps between the Banach spaces $Y_{kb}$ and $Y_{(k-1)b}$ . Moreover we have $\rho[\zeta[\nu]]=\nu$ and $\zeta[\rho[z]]=z$ for all $\nu\in B(0,1)$ and $z\in B_{-b}(0,1)$ .

The proposition is proven at the end of this section. The inclusions $\rho[B_{kb}(0,1)]\subset B_{(k-1)b}(0,1)$ and $\zeta[B_{kb}(0,1)]\subset B_{(k-1)b}(0,1)$ follow from the inequalities

[TABLE]

valid for all $z\in\overline{B_{-b}(0,1)}$ , $\nu\in\overline{B(0,1)}$ , and all $q\in\mathbb{X}$ , assuming $m$ satisfies ( $\mathscr{S}_{b}$ ) by using Theorem 2.3. The difference to the previous results is that we show here uniform convergence of the power series expansions of $\rho$ and $\zeta$ in the relevant norms.

We briefly check (2.16). If $z\in\overline{B_{-b}(0,1)}$ then $|z(q)|\leq||z\mathrm{e}^{-b}||_{\infty}\mathrm{e}^{b(q)}\leq\mathrm{e}^{b(q)}$ for $m$ -almost all $q$ . Since $m$ satisfies condition ( $\mathscr{S}_{b}$ ), it follows by Theorem 2.3 that the measure $z(\mathrm{d}q)=z(q)m(\mathrm{d}q)$ is in the domain of convergence $\mathscr{D}(A)$ of $A$ (though it does not fulfill condition ( $\mathscr{S}_{b}$ )) and $|A(q;z)|\leq b(q)$ , consequently $|\rho(q;z)|\leq\mathrm{e}^{b(q)}|z(q)|$ . If $\nu\in\overline{B(0,1)}$ , then, using again that $m$ satisfies condition ( $\mathscr{S}_{b}$ ), we see that in this case the measure $\nu(\mathrm{d}q)=\nu(q)m(\mathrm{d}q)$ satisfies condition ( $\mathscr{S}_{b}$ ) as well and the bound ( $\mathscr{M}_{b}$ ) yields $|\zeta(q;\nu)|=|\nu(q)T_{q}^{\circ}(\nu)|\leq|\nu(q)|\mathrm{e}^{b(q)}$ .

It is an immediate consequence of Proposition 2.8 that $\rho$ is a bijection from $\mathcal{U}_{b}:=\zeta[B(0,1)]\subset B_{-b}(0,1)$ onto $B(0,1)$ . If $b$ is essentially bounded, then all norms are equivalent, hence $\rho$ and $\zeta$ are holomorphic as maps in $L^{\infty}(\mathbb{X},m)$ and $\mathcal{U}_{b}=\rho^{-1}(B(0,1))$ is open in the non-weighted sup norm $||\cdot||_{\infty}$ . Moreover we have the inclusion

[TABLE]

and we obtain the following corollary.

Corollary 2.9.

Assume that $m\in\mathfrak{M}$ satisfies condition ( $\mathscr{S}_{b}$ ) and in addition $||b||_{\infty}<\infty$ . Then $\rho[\cdot]$ maps some open subset $\mathcal{U}_{b}$ of $B(0,\mathrm{e}^{||b||_{\infty}})\subset L^{\infty}(\mathbb{X},m)$ biholomorphically onto $B(0,1)$ , and the inverse map is $\zeta$ .

Corollary 2.9 points out a situation where Banach inversion does work, which raises the question whether a similar result can be obtained directly, bypassing the introduction of a weight function $b$ . This is indeed possible. Let us fix a reference measure $m$ as before but drop the requirement that $m$ satisfies ( $\mathscr{S}_{b}$ ). Set

[TABLE]

and let

[TABLE]

Theorem 2.10 (Banach inversion).

Assume that (2.17) holds true for some $r>0$ and let $R>0$ be as in (2.18). Let

[TABLE]

Then the functional $\rho$ maps some open neighborhood of the origin $\mathcal{O}\subset B(0,R)\subset L^{\infty}(\mathbb{X},m)$ biholomorphically onto the open ball $B(0,P)$ .

Proof.

The map $\rho:B(0,R)\to L^{\infty}(\mathbb{X},m)$ is holomorphic. The proof of the holomorphicity is similar to the proof of Proposition 2.8 and therefore omitted. The derivative at the origin is the identity: $\mathrm{D}\rho(0)=\mathrm{id}$ . On $B(0,r)\subset B(0,R)$ , the map is bounded by $r\exp(M(r))$ . Therefore, by Theorem B.6, for each $r\in(0,R)$ , the functional $\rho$ maps the open ball $B(0,\frac{1}{4}r\mathrm{e}^{-M(r)})\subset L^{\infty}(\mathbb{X},m)$ biholomorphically onto a domain covering $B(0,\frac{1}{8}r\mathrm{e}^{-M(r)})$ . We optimize over $r$ and obtain the theorem. ∎

*Remark 2.11**.*

Let us compare the radius of convergence of the inverse function $P$ which we obtained with the technique of Banach inversion theorem with the convergence results we obtain with the new inversion technique described in Section 2, namely Corollary 2.9. Let us call $P^{\prime}$ the radius of convergence in the latter case. We will show that $P^{\prime}=8P$ and thus even in those situations where a direct application of Theorem B.6 is possible, it yields a bound that is worse than ours.

Let us first derive an expression of $P^{\prime}$ in terms of $M$ as defined in (2.17). If $m$ satisfies condition ( $\mathscr{S}_{b}$ ) with $||b||_{\infty}<\infty$ , then $M(1)\leq||b||_{\infty}<\infty$ . Conversely, assume $M(s)<\infty$ for some $s>0$ and consider constant weight functions $b(q)\equiv b>0$ . Then, for every $b>0$ , choosing $s>0$ small enough we may assume $M(s\mathrm{e}^{b})\leq b$ and then the rescaled measure $sm$ satisfies condition ( $\mathscr{S}_{b}$ ). Noting that

[TABLE]

we deduce from Corollary 2.9 that $B(0,s)$ is contained in the domain of convergence of the density expansions. An optimization over $b$ and $s$ shows that the domain of convergence contains the open ball $B(0,P^{\prime})$ with radius

[TABLE]

Below we check that $P^{\prime}=8P$ .

Proof of $P^{\prime}=8P$ .

Let $\varepsilon>0$ and $s\geq P^{\prime}-\varepsilon$ . By definition of $P^{\prime}$ , there exists $b>0$ such that $M(s\mathrm{e}^{b})\leq b$ . Set $r:=s\mathrm{e}^{b}$ . Then $M(r)\leq b<\infty$ , thus $r\leq R$ and

[TABLE]

It follows that $8P\geq P^{\prime}$ . Conversely, let $s\geq 8P-\varepsilon$ . By definition of $P$ there exists $r\in(0,R)$ such that $s\leq r\exp(-M(r))$ , hence $1\leq\exp(M(r))\leq\frac{r}{s}$ . Set $b:=\log\frac{r}{s}$ , then $b\geq 0$ , $r=s\mathrm{e}^{b}$ , and

[TABLE]

It follows that $P^{\prime}\geq s\geq 8P-\varepsilon$ . We let $\varepsilon\searrow 0$ and deduce $P^{\prime}=8P$ . ∎

Proof of Proposition 2.8.

We only need to prove that the maps are holomorphic. Consider first the map $\rho$ . We have $\rho(q;z)=z(q)\mathcal{E}(q;z)$ with

[TABLE]

and

[TABLE]

see Appendix A, Eq. (A.7). We show first that $\mathcal{E}:B_{-b}(0,1)\to Y_{-b}$ is holomorphic, by proving that the series (2.19) converges uniformly in the relevant operator norms. Set

[TABLE]

Then for all $r\in[0,1]$ , we have

[TABLE]

because $m$ satisfies condition ( $\mathscr{S}_{b}$ ). In particular, the power series $r\mapsto M_{0}(q;r)$ has radius of convergence $R\geq 1$ . It follows from Cauchy’s inequality for the Taylor coefficients of the series that for all $n\in\mathbb{N}$ ,

[TABLE]

Therefore, we can bound

[TABLE]

As a consequence, the map $P_{n}$ defined on $Y_{-b}$ given by

[TABLE]

satisfies for any $s\in\mathbb{R}$ (we will choose $s$ appropriately at the end)

[TABLE]

It follows from the polarization formulas, see e.g. [Muj06], that the associated multilinear map from $Y_{-b}^{n}$ to $Y_{sb}$ is bounded, whenever $s\leq-1$ or $||b||_{\infty}<\infty$ , hence $P_{n}$ is a continuous $n$ -homogeneous polynomial (see Definition B.1). By (2.21), the series $\mathcal{E}[z]=\sum_{n=1}^{\infty}P_{n}[z]$ converges uniformly in $||\mathrm{e}^{-b}z||_{\infty}\leq 1$ . Therefore, the map $z\mapsto\mathcal{E}[z]$ as a map

[TABLE]

is holomorphic. For $k\geq-1$ , it is also holomorphic as a map

[TABLE]

because $Y_{kb}\subset Y_{-b}$ and $||z\mathrm{e}^{-b}||_{\infty}\leq||z\mathrm{e}^{kb}||_{\infty}$ .

Now we return to $\rho(q;z)=z(q)\mathcal{E}(q;z)$ . By (2.20), we have

[TABLE]

hence

[TABLE]

whenever $s+1\leq k$ and $||z\mathrm{e}^{kb}||_{\infty}\leq 1$ . In order to prove the differentiability, let us introduce

[TABLE]

which will be shown to be the derivative of $\rho[z]$ . Using Cauchy’s inequality we get from the holomorphicity of $\mathcal{E}(z)$ that there exists a $C>0$ with $||\mathrm{e}^{rb}\mathrm{D}\mathcal{E}(z)h||_{\infty}\leq C||h\mathrm{e}^{-b}||_{\infty}\leq C||h\mathrm{e}^{kb}||_{\infty}$ , whenever $r\leq-1$ . Then for $s-k\leq-1$ we get that

[TABLE]

Thus $L_{z}:Y_{kb}\to Y_{sb}$ is bounded. Let us show differentiability directly. Write

[TABLE]

which can be estimated as

[TABLE]

Hence $\rho$ is holomorphic on $||z\mathrm{e}^{kb}||_{\infty}<1$ with values in $Y_{sb}$ for $s+1\leq k$ . Furthermore, $\rho$ is bounded by $1$ because of (2.24). The result is the extremal case $s=k-1$ .

The map $\zeta$ is treated in a completely analogous way. We start from $\zeta(q)=\nu(q)T_{q}^{\circ}(\nu)$ . Since we assume that $m$ satisfies condition ( $\mathscr{S}_{b}$ ), we know from Theorem 2.3 that

[TABLE]

We can now repeat the reasoning for $\rho[z]$ , substituting $\nu$ for $z$ , $T_{q}^{\circ}(q;\nu)$ for $E(q;z)$ , and the bound (2.25) for (2.20). ∎

2.3. An equivalent fixed point equation

In the proof of Lemma 3.9 in Section 3 we need another characterization of the coefficients $t_{n}(q;x_{1},\ldots,x_{n})$ .

Lemma 2.12.

The family $(T_{q}^{\circ})_{q\in\mathbb{X}}$ from Lemma 2.1 is the unique family of formal power series that solves

[TABLE]

Eq. ( $\mathsf{FP}^{\prime}$ ) reflects that $T_{q}^{\circ}(\rho[z])=\exp(A(q;z))$ while the fixed point equation ( $\mathsf{FP}$ ), defining $(T_{q}^{\circ})_{q\in\mathbb{X}}$ , reflects that $T_{q}^{\circ}(\nu)=\exp(A(q;\nu T_{q}^{\circ}(\nu)))$ because $\rho(\xi(\nu))=\nu$ .

Proof.

Let us write $\tilde{t}_{n}$ instead of $t_{n}$ as long as we do not know that the family from Lemma 2.1 satisfies ( $\mathsf{FP}^{\prime}$ ). For the existence and uniqueness of a solution $(\tilde{T}_{q}^{\circ})_{q\in\mathbb{X}}$ to ( $\mathsf{FP}^{\prime}$ ), we note that Eq. ( $\mathsf{FP}^{\prime}$ ) translates into a triangular system of equations for the coefficients $\tilde{t}_{n}$ . The details are similar to the proof of Lemma 2.1 and therefore omitted.

We start from ( $\mathsf{FP}^{\prime}$ ), written for $\tilde{t}_{n}$ ’s instead of $t_{n}$ ’s, and insert $z(\mathrm{d}q)=\nu(\mathrm{d}q)T_{q}^{\circ}(\nu)$ on both sides. This insertion corresponds precisely to the second notion of composition discussed in Appendix A, see Eq. (A.8), and in particular it is a well-defined operation on formal power series. The composition yields two formal power series in $\nu$ , one for the left and one for the right side, called $L$ and $R$ respectively, and of course we must have $L(q;\nu)=R(q;\nu)$ . On the right side we get, by ( $\mathsf{FP}$ ),

[TABLE]

On the left side we have

[TABLE]

The product inside the integral is equal to $1$ because of ( $\mathsf{FP}$ ), therefore $L(q;\nu)=\tilde{T}_{q}^{\circ}(\nu)$ and we conclude from $L=R$ that $\tilde{T}_{q}^{\circ}(\nu)=T_{q}^{\circ}(\nu)$ . In particular, $(T_{q}^{\circ})_{q\in\mathbb{X}}$ solves ( $\mathsf{FP}^{\prime}$ ). ∎

3. Virial expansion. Density functional

In this section we are consider functions $A$ of a special form, cf. (3.8) below, which are coming from a system of objects interacting via a pair potential.

Let $V:\mathbb{X}\times\mathbb{X}\to\mathbb{R}\cup\{\infty\}$ be a measurable pair potential ( $V(x,y)=V(y,x)$ ). We assume that for some measurable function $B:\mathbb{X}\to[0,\infty)$ , we have the stability condition

[TABLE]

for all $n\geq 2$ and $x_{1},\ldots,x_{n}\in\mathbb{X}$ . In addition, we also assume that for all $x\in\mathbb{X}$ and some function $B^{*}:\mathbb{X}\to\mathbb{R}_{+}$ we have

[TABLE]

Define

[TABLE]

for $n\geq 2$ and $H_{0}=0$ , $H_{1}=0$ . Let us introduce for the next few calculation up to (3.6) an extra assumption on $z\in\mathfrak{M}_{\mathbb{C}}(\mathbb{X},\mathcal{X})$ , namely

[TABLE]

In the case that $\mathbb{X}\subset\mathbb{R}^{d}$ and $V$ , $z$ respectively, is a translation invariant function, measure respectively, then the above condition means that the volume of $\mathbb{X}$ with respect to the Lebesgue measure is finite. Hence we say that we are in the “finite volume” case. We will point out which formulas also hold in the “infinite volume” case.

The grand-canonical partition function at activity $z$ and inverse temperature $\beta>0$ is

[TABLE]

Condition (3.3) ensures that $\Xi(\beta,z)$ is finite. The one-particle density is

[TABLE]

Notice

[TABLE]

see Eqs. (A.4) and (A.5) in Appendix A applied to $\log\left(1+\left(\Xi(\beta,z)-1\right)\right)$ . We bring the expression for $\rho$ into the form (2.5). This allows us to extend the definition (3.5) to activities that do not satisfy the finite-volume condition (3.3). Set

[TABLE]

Let $\mathcal{C}_{n}$ be the set of connected graphs $g$ with vertex set $[n]=\{1,\dots,n\}$ , and $E(g)$ the edge set of a graph $g=([n],E(g))$ and

[TABLE]

The aim of the section is to use the result of the previous section for this particular $A_{n}$ . Furthermore, define the well-known Ursell functions

[TABLE]

Let us recall some known results.

Lemma 3.1.

Let $A_{n}(q;x_{1},\ldots,x_{n})$ be as in (3.8) and define $A(q;z)$ as in (2.3). Let $z\in\mathfrak{M}_{\mathbb{C}}$ satisfy only

[TABLE]

for some weight function $a:\mathbb{X}\to\mathbb{R}_{+}$ and all $x\in\mathbb{X}$ . Then $z$ is in the domain of convergence $\mathscr{D}(A)$ .

If in addition $z$ satisfies the finite-volume condition (3.3), then the density $\rho(\mathrm{d}q;z)$ defined in (3.5) is equal to $\exp(-A(q;z))z(\mathrm{d}q)$ , moreover

[TABLE]

with absolutely convergent integrals and series.

The lemma follows from the tree-graph inequality due to [PY17] and additional combinatorial considerations, compare for example [JTTU14, Eq. (4.17)]. The details are similar to aspects of the proof of Lemma 3.7 and therefore omitted.

Definition 3.2.

For activities $z$ that satisfy (3.10) but not necessarily the condition (3.3), we adopt the equality $\rho(\mathrm{d}q;z)=z(\mathrm{d}q)\exp(-A(q;z))$ as the definition of the density.

*Remark 3.3** (Physical interpretation of $A(q;z)$ ).*

Let $W(q;x_{1},\ldots,x_{n}):=\sum_{i=1}^{n}V(q,x_{i})$ be the total interaction of a particle at $q$ with the particles $x_{1},\ldots,x_{n}$ . By (3.5) and Lemma 3.1, we have

[TABLE]

where $\langle\cdot\rangle$ denotes the expectation with respect to the grand-canonical Gibbs measure. Thus $\frac{1}{\beta}A(q;z)$ is the excess free energy for a test particle pinned at the location $q$ .

Let $\mathcal{B}_{n}\subset\mathcal{C}_{n}$ be the set of bi-connected graphs, i.e., graphs that stay connected upon removal of a single vertex. Define

[TABLE]

We want to invert the map $z\mapsto\rho[z]$ and express the inverse with bi-connected graphs. Before that we derive a convergent result for power series with coefficients given by bi-connected graphs.

Theorem 3.4.

Let $\nu\in\mathfrak{M}_{\mathbb{C}}$ . Suppose there exist functions $a,b:\mathbb{X}\to\mathbb{R}_{+}$ with $a\leq b$ on $\mathbb{X}$ such that

[TABLE]

for all $x\in\mathbb{X}$ . Then

[TABLE]

for all $q\in\mathbb{X}$ .

Define $\mathsf{V}_{b}$ by

[TABLE]

Theorem 3.5.

There is a set $\mathsf{U}_{b}\subset\mathscr{D}(A)\subset\mathfrak{M}_{\mathbb{C}}$ such that $z\mapsto\rho[z]$ is a bijection from $\mathsf{U}_{b}$ onto $\mathsf{V}_{b}$ , and for every $z\in\mathsf{U}_{b}$ , $\nu\in\mathsf{V}_{b}$ , we have $\rho[z]=\nu$ if and only if

[TABLE]

where the latter converge in the sense that ( $\mathsf{M}_{b}$ ) holds.

If $z\in\mathfrak{M}_{\mathbb{C}}$ fulfills ( $\mathsf{S}_{a,b}$ ) for some $a\leq b$ and $\mathrm{e}^{a}|z|\in\mathsf{V}_{b}$ for the same functions $a$ and $b$ , then $\rho[z]\in\mathsf{V}_{b}$ and hence $z\in\mathsf{U}_{b}$ .

If instead the following conditions including also a “finite volume condition” holds,

[TABLE]

then also

[TABLE]

The condition $\mathrm{e}^{a}z\in\mathsf{V}_{b}$ is a condition directly in terms of $z$ which is sufficient to guarantee that $z\in\mathsf{U}_{b}$ . Recall that $\mathsf{U}_{b}$ was just defined indirectly as the image of $\zeta$ .

Formula (3.14) does not make any sense in the “infinite volume case” even if we consider the translation invariant case as discussed below (3.3). In this case, though, the right hand side is proportional to the volume of $\mathbb{X}$ , up to boundary errors. Hence, $\log\Xi(\beta,z)$ divided by the volume has a well defined limit.

For the definition of the free energy, we fix a reference measure $m(\mathrm{d}x)$ on $\mathbb{X}$ (for example, the Lebesgue measure on $\mathbb{R}^{d}$ ). The (grand-canonical) free energy $\mathcal{F}_{\mathrm{GC}}[\nu]$ of a given density profile $\nu\in\mathfrak{M}$ is defined via the Legendre transform of $\log\Xi(z)$ as

[TABLE]

with $\frac{\mathrm{d}z}{\mathrm{d}m}$ the Radon-Nikodým derivative of $z$ with respect to the reference measure $m$ . The supremum in (3.15) is over all non-negative measures $z\in\mathfrak{M}$ that are absolutely continuous with respect to $m$ and such that the integral with the logarithm is absolutely convergent.

Theorem 3.6.

Assume that $\nu\in\mathsf{V}_{b}\cap\mathfrak{M}$ is absolutely continuous with respect to $m$ and satisfies

[TABLE]

then

[TABLE]

with absolutely convergent integrals and sum.

Let us first check that condition ( $\mathsf{S}_{a,b}$ ) is sufficient for the convergence of $A(q;z)$ .

Lemma 3.7.

If $\nu$ satisfies condition ( $\mathsf{S}_{a,b}$ ) for some $a,b:\mathbb{X}\to\mathbb{R}_{+}$ with $a\leq b$ , then $\nu$ satisfies $|A(q;\nu)|\leq a(q)$ and in particular condition ( $\mathscr{S}_{b}$ ), where $A$ is defined as in (2.3) with $A_{n}$ given by (3.8).

Proof.

Set

[TABLE]

The first factor in (3.8) can be bounded as follows

[TABLE]

Indeed, this follows by induction in $n$ using

[TABLE]

and using that $|e^{-u}-1|\leq e^{\max\{-u,0\}}(1-e^{-|u|})$ . Using this bound, we get

[TABLE]

In order to bound $\mathcal{R}(q;\mathrm{e}^{\beta B^{*}+b}\nu)$ , we use a recent tree-graph inequality due to Procacci and Yuhjtman [PY17] in the form presented in [Uel17]. Then

[TABLE]

with $\mathcal{T}_{n}\subset\mathcal{C}_{n}$ the set of trees with vertex set $[n]$ . As a consequence, if a non-negative measure $\mu$ satisfies

[TABLE]

for all $q\in\mathbb{X}$ , then

[TABLE]

The inductive proof of (3.21) is similar to the proof of [PU09, Theorem 2.1] and therefore omitted. Condition ( $\mathsf{S}_{a,b}$ ) implies that $\mu:=\exp(\beta B^{*}+b)|\nu|$ satisfies

[TABLE]

Hence (3.20) and (3.21) hold true, and we can further bound (3.19) by

[TABLE]

which completes the proof. ∎

Next let us relate the coefficients of $A(q;z)$ with bi-connected graphs.

Lemma 3.8.

The formal power series $A(q;z)$ with coefficients (3.8) satisfies

[TABLE]

Proof.

The lemma follows from well-known identities for connected and bi-connected graphs, see for example [Ste64, Ler04, Far12, MH61], we sketch the argument for the reader’s convenience. If $J\subset\mathbb{N}$ is a finite non-empty set, consider the following classes of graphs with vertex set $J\cup\{0\}$ :

•

$\mathcal{C}^{\circ}(J)$ , the connected graphs on $J\cup\{0\}$ ;

•

$\mathcal{B}^{\circ}(J)$ , the biconnected graphs on $J\cup\{0\}$ ;

•

$\mathcal{A}^{\circ}(J)$ , the connected graphs that stay connected when removing [math] and the incident edges (equivalently, the connected graphs for which [math] is not an articulation point).

If $g$ is a graph with vertex set $J\cup\{0\}$ , define $w(g;(x_{i})_{i\in J\cup\{0\}})=\prod_{\{i,j\}\in E(g)}f(x_{i},x_{j})$ . Then

[TABLE]

In view of (A.7), setting $x_{0}=q$ , the coefficients of $\exp(-A(q;z))$ are given by

[TABLE]

By Eq. (A.8), the right-hand side of (3.22) is a power series $F(q;z)$ with coefficients

[TABLE]

Eq. (3.24) allows us to rewrite $F_{n}(q;x_{1},\ldots,x_{n})$ as a sum over tuples $(m,g_{0},g_{1},\ldots,g_{m})$ consisting of an integer $m\in\{1,\ldots,n\}$ and graphs $g_{0}\in\mathcal{B}^{\circ}(L)$ , $g_{\ell}\in\mathcal{C}^{\circ}(J_{\ell})$ where $\#L=m$ and $L,J_{1},\ldots,J_{\ell}$ form a partition of $[n]$ with $J_{\ell}=\varnothing$ allowed. Given such a tuple $(m,g_{0},g_{1},\ldots,g_{m})$ , a new graph $g$ is defined by gluing each $g_{\ell}$ to $g_{0}$ at the vertex $\ell$ (the vertex $\ell$ is identified with root [math] of $g_{\ell}$ ). Precisely, $\{i,j\}$ is an edge of $g$ if and only if:

•

either $i,j\in L$ and $\{i,j\}\in E(g_{0})$ ,

•

or for some $\ell\in L$ we have $i,j\in J_{\ell}$ and $\{i,j\}\in E(g_{\ell})$ ,

•

or for some $\ell\in L$ we have $i=\ell$ and $j\in J_{\ell}$ (or vice-versa) and $\{0,j\}\in E(g_{\ell})$ .

In the new graph $g$ , each of the vertices $\ell\in L$ is an articulation point (that is upon the removal of $\ell$ and the edges incident to $\ell$ the graph $g$ has a connect component which does not contain [math]. However, note that there can be other articulation points inside the $J_{\ell}$ ’s!), and the support $J_{\ell}$ of the graph $g_{\ell}$ consists of those vertices $j\in[n]$ for which every path connecting $j$ to [math] has to pass through $\ell$ . The weight of the new graph is equal to the product of the weights of the $g_{\ell}$ ’s.

The rule $(m,g_{1},\ldots,g_{m})\mapsto g$ defines a one-to-one correspondence between the tuples under consideration and graphs $g\in\mathcal{A}^{\circ}([n])$ , and the weights are multiplicative. One deduces that $F_{n}(q;x_{1},\ldots,x_{n})$ is given by a sum over graphs $g\in\mathcal{A}^{\circ}([n])$ and weights as in (3.23), therefore (3.22) holds true. ∎

As a consequence we can identify the coefficients of $T_{q}^{\circ}(\nu)$ .

Lemma 3.9.

For $A_{n}(q;x_{1},\ldots,x_{n})$ given by (3.8), the family $(T_{q}^{\circ})_{q\in\mathbb{X}}$ from Lemma 2.1 is given by

[TABLE]

Proof.

Lemma 3.8 yields

[TABLE]

Hence the right-hand side of (3.25) solves the fixed point equation ( $\mathsf{FP}^{\prime}$ ) as considered in Lemma 2.12,furthermore the lemmas yields that, as the solution of the fixed point equation, the right hand side must be equal to the family $(T_{q}^{\circ})_{q\in\mathbb{X}}$ from Lemma 2.1. ∎

Proof of Theorem 3.4.

If $\nu$ satisfies ( $\mathsf{S}_{a,b}$ ), then by Lemma 3.7 it also satisfies ( $\mathscr{S}_{b}$ ). However, by Theorem 2.3, it follows that ( $\mathscr{M}_{b}$ ) holds true as well, in particular $T_{q}^{\circ}(\nu)$ is absolutely convergent and $|T_{q}^{\circ}(\nu)|\leq\exp(b(q))$ . Combining Eqs. (3.25) and ( $\mathsf{FP}$ ) we get

[TABLE]

as formal power series, that means, that the coefficients of the series coincides. If we take the absolute value of the coefficients and we reconstruct the right hand side of the above equality one gets that

[TABLE]

where we define

[TABLE]

The right-hand side is bounded by $b(q)$ because of ( $\mathscr{M}_{b}$ ) and ( $\mathscr{S}_{b}$ ). ∎

Proof of Theorem 3.5.

Let $\zeta[\nu](\mathrm{d}q)=\zeta(\mathrm{d}q;\nu)=\nu(\mathrm{d}q)T_{q}^{\circ}(\nu)$ as in (2.12). Set $\mathsf{U}_{b}:=\zeta[\mathsf{V}_{b}]$ . By Lemma 3.7, we know that $\mathsf{V}_{b}\subset\mathscr{V}_{b}$ hence Theorem 2.5 guarantees $\mathsf{U}_{b}\subset\mathscr{U}_{b}\subset\mathscr{D}(A)$ . It follows from Theorem 2.5 that $\rho$ is a bijection from $\mathsf{U}_{b}$ onto $\mathsf{V}_{b}$ with inverse $\zeta$ , hence $\rho[z]=\nu$ if and only if $z(\mathrm{d}q)=\nu(\mathrm{d}q)T_{q}^{\circ}(\nu)$ . We insert the formula (3.25) from Lemma 3.9 for $T_{q}^{\circ}(\nu)$ and obtain (3.12).

Let $z\in\mathfrak{M}_{\mathbb{C}}$ satisfy ( $\mathsf{S}_{a,b}$ ) and $\mathrm{e}^{a}|z|\in\mathsf{V}_{b}$ , then $|A(q;z)|\leq a(q)$ by Lemma 3.7. By Definition 3.2, the density is given by $\rho(\mathrm{d}q;z)=z(\mathrm{d}q)\mathrm{e}^{-A(q;z)}$ which is bounded by $|\rho|(\mathrm{d}q;z)\leq|z|(\mathrm{d}q)\mathrm{e}^{a(q)}\in\mathsf{V}_{b}$ . As $\zeta[\rho[z]]=z$ in the sense of formal power series and $\zeta$ is a convergent on $\mathsf{V}_{b}$ , it remains to show that the composition is also convergent. For that we do not only need that $\rho[z]\in\mathsf{V}_{b}$ but also that all the interchanges are allowed, that is, an estimate in terms of $|z|$ and $|A_{n}|$ , namely ( $\mathscr{S}_{b}$ ). Therefore, we finally get $z\in\mathsf{U}_{b}$ .

As an equality of formal power series, Eq. (3.14) follows from the dissymmetry theorem for connected and biconnected graphs and power series manipulations similar to the proof of Lemma 3.8. Precisely, we have the following identity

[TABLE]

The proof of (3.27) is easily adapted from [JTTU14, Theorem 3.1] or [Ler04] and therefore omitted. The first part of condition (3.13) is condition (3.10) from Lemma 3.1, we have established (3.14) in the sense of formal power series. Next, we check absolute convergence of the power series associated with the terms in Eq. (3.27).

Let us consider (3.27) term by term starting from the left. Consider

[TABLE]

The first part of condition (3.13) is the same as condition (3.20) with $|z|$ instead of $\mu$ , so we may apply the bound (3.21) and get that

[TABLE]

Hence the formal power series for $\log\Xi(\beta,z)$ is converging exactly in the sense that (3.28) is finite.

Next, by (3.29) and condition (3.13), we also have

[TABLE]

which is the sense in which the power series for $\int_{\mathbb{X}}\rho(\mathrm{d}x_{1};z)$ converges.

Finally, define $\tilde{\nu}(\mathrm{d}q):=\mathcal{R}(q;|z|)\,|z|(\mathrm{d}q)$ which by (3.29) is bounded by $\tilde{\nu}\leq\mathrm{e}^{a+\beta B}|z|$ .

Now $\mathrm{e}^{a+\beta B}|z|$ is in $\mathsf{V}_{b}$ by the second condition in (3.13) and therefore $\tilde{\nu}$ is in $\mathsf{V}_{b}$ as well. Thus we can bound

[TABLE]

where in the third but last inequality we applied ( $\mathsf{M}_{b}$ ) with $\tilde{\nu}$ instead of $|\nu|$ . At the very end we have used again condition (3.13). This is the sense in which the third term converges. Note that the sense of convergence is strong enough, such that that also re-ordering of the terms is converging so long one does not break up $D_{m}$ . As a consequence, Eq. (3.14) holds true not only as an equality of formal power series but also as an equality of convergent sums. ∎

Proof of Theorem 3.6.

The standard line of reasoning is as follows: we check that the solution $z$ to the equation $\rho[z]=\nu$ —which exists by Theorem 3.5—is a maximizer in (3.15), deduce a formula for $\mathcal{F}_{\mathrm{G}C}[\nu]$ in terms of the maximizer $z$ , plug in (3.12) and (3.14), and obtain the statement. The full proof requires us to check that all steps are fully justified.

It is convenient to rewrite the definition (3.15) as

[TABLE]

where the supremum is taken over all measurable $h:\mathbb{X}\to\mathbb{R}\cup\{-\infty\}$ such that $\int_{\mathbb{X}}|h|\mathrm{d}\nu<\infty$ .

Let $\nu\in\mathsf{V}_{b}$ satisfy the assumptions of the theorem. By Theorem 3.5, the measure $z_{0}:=\zeta[\nu]$ satisfies $\rho[z_{0}]=\nu$ . Therefore, it is of the form $z_{0}(\mathrm{d}q)=\mathrm{e}^{h_{0}(q)}m(\mathrm{d}q)$ with

[TABLE]

We check that $h_{0}$ is a maximizer in (3.32). As a preliminary observation, we note that $|h_{0}(q)|\leq|\log\frac{\mathrm{d}\nu}{\mathrm{d}m}(q)|+b(q)$ using ( $\mathsf{M}_{b}$ ), therefore condition (3.16) yields $\int_{\mathbb{X}}|h_{0}|\mathrm{d}\nu<\infty$ . Thus $h_{0}$ does indeed belong to the set over which the supremum in (3.32) is taken.

Let $h:\mathbb{X}\to\mathbb{R}\cup\{-\infty\}$ be another function with $\int_{\mathbb{X}}|h|\mathrm{d}\nu<\infty$ . We need to check that

[TABLE]

By the last condition in (3.16), the measure $z_{0}=\mathrm{e}^{h_{0}}m$ satisfies condition (3.3) and so $\Xi[\mathrm{e}^{h_{0}}m]<\infty$ and the right-hand side in (3.33) is finite. If $\Xi[\mathrm{e}^{h}m]=\infty$ , then the inequality (3.33) holds trivially true. If $\Xi[\mathrm{e}^{h}m]<\infty$ , then the inequality (3.33) is equivalent to

[TABLE]

and it will be checked with the help of convexity. Set

[TABLE]

It is a well-known consequence of Hölder’s inequality that $g(t)$ is convex.

Next we check that the right derivative of $g$ at zero exists and is given by $g^{\prime}(0)=\int_{\mathbb{X}}(h-h_{0})\mathrm{d}\nu$ . We look at the derivative of $\exp(g(t))$ first. Set $h_{t}:=(1-t)h_{0}+th$ . We have

[TABLE]

To facilitate differentiation, we check that configurations with infinite $h_{t}(x_{i})$ ’s do not contribute. As $\int_{\mathbb{X}}e^{h}\mathrm{d}m\leq\Xi[\mathrm{e}^{h}m]<\infty$ we have $m$ -a.e. that $h<\infty$ . Furthermore, we can see that

[TABLE]

where we used in the first equality that $\mathrm{e}^{\sum_{i=1}^{n}h_{t}(x_{i})}{\mathchoice{1\mskip-4.0mu\mathrm{l}}{1\mskip-4.0mu\mathrm{l}}{1\mskip-4.5mu\mathrm{l}}{1\mskip-5.0mu\mathrm{l}}}_{\{\exists i:\,h(x_{i})=-\infty\}}=0$ and in the inequality that $\nu=\rho[z_{0}]$ . By choice of $h=h_{1}$ , the integral $\int_{\mathbb{X}}|h|\mathrm{d}\nu$ is finite, hence $h>-\infty$ , $\nu$ -almost everywhere. It follows that the last expression in (3.36) vanishes, hence also all preceding expressions in the chain of inequalities vanish. Therefore we have that $m$ -a.e. holds $|h(x)|<\infty$ . The same holds for $h_{0}$ . As $|h_{t}(x)|=\infty$ only if either $|h_{0}(x)|$ or $|h_{1}(x)|=|h(x_{i})|$ are infinite we have that

[TABLE]

has full $m$ -measure and hence $h_{t}$ is well-defined on $C$ . The considerations above yield

[TABLE]

for all $t\in[0,1]$ . We also have as $\nu=\rho[z_{0}]$ that

[TABLE]

and therefore it holds that

[TABLE]

Each integrand goes to zero as $t\to 0$ , we need a $t$ -independent integrable upper bound in order to apply dominated convergence. For $a,u\in\mathbb{R}$ and $t>0$ we have

[TABLE]

If $u>0$ , pick $\varepsilon\in(0,1)$ and assume $t\in(0,1-\varepsilon)$ . We apply the inequality $x\mathrm{e}^{-x}\leq\mathrm{e}^{-1}$ to $x=\varepsilon u$ and find that the upper bound is $u\exp(a+tu)\leq(\varepsilon\mathrm{e})^{-1}\exp(a+(t+\varepsilon)u)\leq(\varepsilon\mathrm{e})^{-1}u\exp(a+u)$ . Altogether we find

[TABLE]

This inequality applied to $\varepsilon=1/2$ , $a=\sum_{i}h_{0}(x_{i})$ and $u=\sum_{i}(h_{1}(x_{i})-h_{0}(x_{i}))$ yields, for $t\in(0,1/2)$ , that the integrand in (3.39) is bounded in absolute value by

[TABLE]

When one integrates over $x_{1},\ldots,x_{n}$ , multiply with $\frac{1}{n!}$ , sum over $n$ , one obtains

[TABLE]

Thus we may apply dominated convergence to (3.39) and find that indeed

[TABLE]

from which we deduce $g^{\prime}(0)=\int_{\mathbb{X}}(h_{1}-h_{0})\mathrm{d}\nu$ . We have already observed that $g(t)$ is convex and hence one has that $g(t)\geq g(0)+g^{\prime}(0)t$ , which for $t=1$ is precisely the inequality (3.34). It follows that $h_{0}$ is a maximizer in (3.33) and

[TABLE]

The final step is to insert the expression for $\log\Xi[\zeta[\nu]]$ from Eq. (3.14) in Theorem 3.5, keeping in mind that $\rho[\zeta[\nu]]=\nu$ . This then yields (3.17).

To justify the application of (3.14), we could in principle impose conditions on $\nu$ that guarantee that $z_{0}=\zeta[\nu]$ satisfies the condition (3.13) from Theorem 3.5, however this would result in more restrictive conditions and therefore we take a slightly different approach. We start from the formal power series identity

[TABLE]

which follows from (3.14) and $\rho[\zeta[\nu]]=\nu$ . It is justified, as a formal power series identity, without any conditions on $\nu$ . Additional arguments are needed to ensure that (3.41) holds true as an equality of convergent expressions. The exponential of the left-hand side of (3.41) is the formal power series

[TABLE]

see Eq. (A.8) in Appendix (A). The set $L$ is non-empty but $J_{\ell}=\varnothing$ is allowed (we agree $t_{0}=1$ ). We have

[TABLE]

with

[TABLE]

The term in parentheses is smaller or equal to $\exp(b(q))$ by our assumption $\nu\in\mathsf{V}_{b}$ and Lemma 3.7, therefore

[TABLE]

by the last assumption on $\nu$ in (3.16). It follows that $\mu$ satisfies the finite-volume condition (3.3), hence $\Xi(\mu)$ is finite and thus (3.43) is finite. It follows that (3.42) is equal to $\Xi[\zeta[\nu]]$ not just as a formal power series but as an equality of convergent series.

Similar considerations apply to the right-hand side of (3.41). It follows that (3.41) holds true as an equality of convergent series. We plug the expression for $\Xi[\zeta[\nu]]$ from (3.41) into the formula (3.40) and obtain the expression (3.17) for the free energy. ∎

4. Examples

4.1. Homogeneous gas

Consider a homogeneous gas of particles in a domain $\Lambda\subset\mathbb{R}^{d}$ , interacting via a translationally invariant pair potential $V(x,y)=v(x-y)$ , with $v(x)=v(-x)$ . The potential is assumed to be stable,

[TABLE]

for some $B\geq 0$ , all $N\geq 2$ , and all $x_{1},\ldots,x_{N}\in\mathbb{R}^{d}$ . Furthermore, we assume

[TABLE]

Further assume that $\inf v\geq-B^{*}$ for some $B^{*}\in(0,\infty)$ . Mayer’s irreducible cluster integrals are defined as

[TABLE]

which in terms of the coefficients $D_{n}$ from (3.11), can be expressed as

[TABLE]

The grand-canonical partition function $\Xi_{\Lambda}(\beta,z)$ at inverse temperature $\beta>0$ and activity $z>0$ is defined in the usual way, and the pressure is given by

[TABLE]

with the limit taken along van Hove sequences [Rue69]. Further set

[TABLE]

It is well-known [Rue69] that if $C(\beta)\mathrm{e}^{2\beta B}|z|\leq\frac{1}{\mathrm{e}}$ , then the limit (4.2) and the derivative (4.3) exist, moreover they define functions that are analytic in $C(\beta)\mathrm{e}^{2\beta B}|z|<\frac{1}{\mathrm{e}}$ (at least), we use the same letters for the analytic extensions to the complex disk. We fix $\beta>0$ and drop the $\beta$ -dependence from the notation in $p_{\beta}(z)$ and $\rho_{\beta}(z)$ .

Theorem 4.1.

(a)

If $\nu\in\mathbb{C}$ satisfies $\bar{C}(\beta)\mathrm{e}^{\beta[B+B^{*}]}|\nu|\leq\frac{1}{2\mathrm{e}}$ , then $\sum_{n=1}^{\infty}|\beta_{n}\nu^{n}|\leq\frac{1}{2}$ . In particular, the radius of convergence $R_{\mathrm{vir}}$ of $\sum_{n}\beta_{n}\nu^{n}$ is bounded from below by

[TABLE] 2. (b)

There exists some neighborhood $\mathcal{O}$ of the origin with

[TABLE]

such that $\rho(\cdot)$ is a bijection from $\mathcal{O}$ onto the open ball $B(0,R^{*})$ , with inverse

[TABLE] 3. (c)

For all $z\in\mathcal{O}$ , we have

[TABLE] 4. (d)

For all $\rho\in(0,R^{*})$ , the Helmholtz free energy $f(\rho):=\sup_{z>0}(\beta^{-1}\rho\log z-p(z))$ is given by

[TABLE]

Let us compare our result to what was known before. The bound (4.4) should be contrasted with the best known bound

[TABLE]

where

[TABLE]

(the lower bound is sharp, it is actually $k=\frac{(W(\mathrm{e}/2)-1)^{2}}{W(\mathrm{e}/2)}$ , cf. [Tat13]) and

[TABLE]

For non-negative pair potentials, we have $\bar{B}=0$ and (4.7) coincides with the lower bound proven by Lebowitz and Penrose [LP64], who also proved the lower bound in (4.8). For attractive pair potentials, the bound (4.7) is an improvement on the bound from [LP64], which was proven in [Pro17], where the constant $\bar{B}$ is called the Basuev stability constant. The constant $\bar{B}$ also enters an asymptotic upper bound to $R_{\mathrm{vir}}$ as $\beta\to\infty$ , see [Jan12, Theorem 2.8].

Let us compare our bound (4.4) with (4.7). It differs in two ways: it has a different constant $\frac{1}{2\mathrm{e}}$ and a different exponential $\exp(-\beta(B^{*}+B))$ . Our constant $\frac{1}{2\mathrm{e}}$ is better but for attractive interactions our exponential in general is worse. As a consequence, for non-negative interactions, our bound yields a considerable improvement over the bound from [LP64] and hence (4.7)

[TABLE]

therefore our bound improves substantially all known bound. The improvement subsists for attractive interactions with small $\beta$ . For large $\beta$ or strong interactions, the bound (4.7) due to [Pro17] trumps ours.

*Remark 4.2** (Attractive potentials).*

Additional work is needed to see whether our exponent $\exp(-\beta(B+B^{*}))$ in (4.4) can be replaced by the exponent $\exp(-\beta\bar{B})$ as in (4.7). This is related to the fact that bounding $b_{n}$ ’s in the Mayer expansion $\rho(z)=\sum_{n=1}^{\infty}nb_{n}z^{n}$ may sometimes be better than bounding $a_{n}$ in the representation $\rho(z)=z\exp(-\sum_{n=1}^{\infty}a_{n}z^{n})$ . Indeed, in our approach, the factor $\exp(-\beta B^{*})$ comes up in Lemma 3.7 where, in order to write $\rho(z)/z$ the density as an exponential $\exp(-A(z))$ and bound the exponent, we split the expansion of $A$ and we get an additional factor $\exp(\beta B^{*})$ in Eq. (3.18).

*Remark 4.3** (Relation with Lagrange inversion).*

After the proof of Theorem 4.1 we will explain how to recover our bound (4.4) in the case $B=0$ based on a slightly different treatment of the Lagrange inversion from [LP64], and where exactly our gain is achieved.

*Remark 4.4** (Further improvements for non-negative pair potentials).*

The factor $\frac{1}{2\mathrm{e}}$ could be further improved using our techniques combining them with the refined tree-graph inequality from [FPS07], i.e., working with trees where children communicate, resulting in additional constraints on trees. Instead of the generating function of the trees, one has to consider the solution of the equation

[TABLE]

where $\tilde{g}_{d}(k)$ are defined as in [FPS07], where it was used for the activity expansion. Doing so, for hard disks, that is $d=2$ , one obtains $0.196$ as a lower bound for the radius of convergence of the virial expansion instead of $\frac{1}{2\mathrm{e}}\approx 0.184$ .

Proof of Theorem 4.1.

We apply the considerations from Section 3 to the case $\mathbb{X}=\mathbb{R}^{d}$ , $\mathcal{X}$ the Borel sets, and specialize to translationally invariant measures $z(\mathrm{d}x)=z\mathrm{d}x$ with a constant scalar $z$ . For such a measure the measure $\rho(\mathrm{d}q;z)$ given by $\exp(-A(q;z))z(\mathrm{d}q)$ is translationally invariant as well, we write $\rho(\mathrm{d}q;z)=\rho(z)\mathrm{d}q$ and note that $\rho(z)$ is equal to the limit (4.3), moreover $\rho(z)=z\exp(-A(z))$ with

[TABLE]

Conversely, if $\nu(\mathrm{d}q)=\nu\mathrm{d}q$ is a translationally invariant measure, then the inverse $\zeta(\mathrm{d}q;\nu)$ from is translationally invariant as well.

By Theorem 3.5 applied to $\nu(\mathrm{d}x)=\nu\mathrm{d}x$ , constant functions $a$ and $b$ , if the number $\nu\in\mathbb{C}$ satisfies

[TABLE]

for some $a,b\geq 0$ with $a\leq b$ , then $\nu\in\mathsf{V}_{b}$

[TABLE]

(remember (4.1) and ( $\mathsf{M}_{b}$ )). Condition (4.10) is further evaluated as

[TABLE]

Therefore if $\bar{C}(\beta)\mathrm{e}^{\beta(B+B^{*})}|\nu|\leq\frac{1}{2\mathrm{e}}$ , then condition (4.10) holds true with $a=b=\frac{1}{2}$ and Eq. (4.11) holds true with $b=\frac{1}{2}$ . Part (a) of the theorem follows.

Part (b) follows from the first part of Theorem 3.5, with $b=1/2$ and $\mathsf{V}_{1/2}=B(0,R^{*})$ . Then $|\zeta[\nu]|=|\nu|\ \left|T^{\circ}(\nu)\right|\leq R^{*}e^{b}$ by Theorem 2.3. For $z\in\mathbb{C}$ there exists $a,b$ with $0\leq a\leq b$ such that $e^{a}|z|\in\mathsf{V}_{b}$ if and only if $C(\beta)\mathrm{e}^{\beta(B+B^{*})}|z|\leq\frac{1}{\mathrm{e}\mathrm{e}^{2/\mathrm{e}}}$ . Note that $e^{a}|z|\in\mathsf{V}_{b}$ means that ( $\mathsf{S}_{a,b}$ ) holds for an $0\leq\tilde{a}\leq b$ instead of $a$ and not necessarily $a=\tilde{a}$ .

For part (c), we note that the validity of (4.5) for sufficiently small $|z|$ is already known [LP64]. Alternatively, we may deduce from Theorem 3.5 by working first in finite volume and then taking the infinite-volume limit. This way of proceding guarantees the validity of (4.5) under the additional condition $\mathrm{e}^{a+\beta B}|z|<R^{*}$ for some $0\leq a\leq\frac{1}{2}$ . The additional condition is eliminated by invoking analyticity: The left and right sides of (4.5) define functions of $z$ that are analytic in $\mathcal{O}$ and coincide on some non-empty open ball, therefore they are equal on all of $\mathcal{O}$ .

Theorem 3.6, using that in part (a) we have shown $\rho\in\mathsf{V}_{b}$ whenever $0\leq|\rho|\leq R^{*}$ , gives part (d) of the theorem in finite volume. By part (a) the radius of convergence of the series in (4.6) is independent of the volume. Combining with the translation invariance we see that the right-hand side of (3.17), divided by the volume, converges to the right-hand side of (4.6) in the infinite-volume limit. Furthermore, that the free-energy is the Legendre transform of the pressure can be extended to the infinite volume as well. We note that the validity of (4.5) for sufficiently small $|z|$ was already known [LP64]. ∎

Let us provide an alternative derivation of the bound (4.4) for non-negative potentials ( $B=0$ ). The key point in [LP64] is a lower bound for the radius of convergence $R_{\mathrm{vir}}$ of the expansion in $\rho$ as

[TABLE]

which is derived in [LP64] using a Lagrange inversion /tk, where $R$ is the convergence radius of $\rho$ at zero. A lower bound for $R_{\mathrm{vir}}$ is then deduced from a lower bound for $|\rho(z)|$ . This is done in [LP64] (and also in [Tat13]) with the help of the triangle inequality $|\rho(z)|\geq|z|-|\rho(z)-z|$ . It turns out that if, instead, one uses the exponential structure $\rho(z)=z\mathrm{e}^{-A(z)}$ and an upper bound for $|A(z)|$ one can recover our bound (4.4) from (4.12). Let us explain the strategy. Our aim is to prove the following chain of inequalities

[TABLE]

where $T(z)=\sum_{n\geq 1}\frac{n^{n-1}}{n!}z^{n}$ is the generating function of labelled rooted trees (equivalently, $T(z)=-W(-z)$ with $W$ the Lambert function) and $R$ is the radius of convergence of the analytic function $A(z)$ .

The first inequality in Eq. (4.13), merely uses the idea $\rho(z)=z\mathrm{e}^{-A(z)}$ . The second inequality can be derived in several ways. It follows directly from Penrose’s tree-graph inequality, namely as in Lemma 3.7 use estimate (3.18) with $B^{*}=0$ , then by the tree-graph inequality $\mathcal{R}(q;z)\leq\frac{T(\bar{C}(\beta)|z|)}{\bar{C}(\beta)|z|}$ . Hence one gets $|A(z)|\leq T(\bar{C}(\beta)|z|)$ . Alternatively, following more the type of results used in this article, one gets from the inductive proof of Theorem 2.1 in [PU09], cf. Lemma 3.7 as well, that $|A(z)|\leq a$ whenever $\bar{C}(\beta)\mathrm{e}^{a}|z|\leq a$ . It remains to optimize over $a$ .Recall that $T(s)$ is converges on $[0,1/\mathrm{e}]$ with $T(1/\mathrm{e})=1$ and is also the branch of the real solution of the relation $T(s)=s\mathrm{e}^{T(s)}$ with $T(s)\leq 1$ . Hence, $T(s)$ satisfies, for $s\geq 0$ ,

[TABLE]

Since $T(s)$ diverges for $s>1/\mathrm{e}$ , Eq. (4.14) stays true for $s>1/\mathrm{e}$ if we interpret the infimum of the empty set as infinity. Equation (4.14) follows from the relation $T(s)=s\mathrm{e}^{T(s)}$ solved by $T$ , the bound $T(s)\leq T(1/e)=1$ and the the fact that $a\mapsto a\mathrm{e}^{-a}$ is strictly increasing on $[0,1]$ . Consequently, using (4.14) we get

[TABLE]

Hence the bound is finite and thus the radius of convergence of $A(z)$ is $R<\frac{1}{\mathrm{e}\bar{C}(\beta)}$ .

Finally, the third inequality can be derived using again (4.14) and $T(s)=s\mathrm{e}^{T(s)}$ , we have

[TABLE]

Setting $s=\bar{C}(\beta)r$ we deduce the final bound in (4.13), which is the same as (4.4) in the case of non-negative potential.

4.2. Inhomogeneous gas

Here we start from a homogeneous gas with fixed reference activity $z_{0}>0$ and then add an external potential $V_{\mathrm{ext}}(x)$ . The grand-canonical partition function in some bounded domain $\Lambda$ becomes

[TABLE]

and the density is given by

[TABLE]

Eq. (4.18) can be brought into the form from Section 3: let

[TABLE]

then

[TABLE]

similarly for the partition function. It follows from the tree-graph inequality in [PY17] that if

[TABLE]

for some $a:\mathbb{R}^{d}\to\mathbb{R}_{+}$ and all $x\in\mathbb{R}^{d}$ , then the limit

[TABLE]

exists and is given by the usual combinatorial formulas, with position-dependent activity $z(x)$ given in (4.19).

It is a classical problem to ask whether, given a density profile $\rho(x)$ , there exists a background potential $V_{\mathrm{ext}}(x)$ such that the density profile $\rho(x;V_{\mathrm{ext}})$ in the associated grand-canonical ensemble is equal to the given profile $\rho(x)$ . In view of (4.19), Theorem 3.5 has direct implications for this problem when activities converge. For results without cluster expansions, see [CCL84].

Theorem 4.5.

Fix $\beta,z_{0}>0$ and a pair potential $v(x-y)$ with stability constant $B$ and lower bound $\inf v\geq-B^{*}>-\infty$ . Let $\rho:\Lambda\to\mathbb{R}_{+}$ be a measurable function such that

[TABLE]

for all $x\in\mathbb{R}^{d}$ and some functions $a,b:\mathbb{R}^{d}\to\mathbb{R}_{+}$ with $a\leq b$ pointwise. Then there exists a unique (up to null sets) background potential $V_{\mathrm{ext}}:\Lambda\to\mathbb{R}\cup\{\infty\}$ that satisfies (4.21) and such that $\rho(q;V_{\mathrm{ext}})=\rho(q)$ for Lebesgue-almost all $q$ . It is given by

[TABLE]

with absolutely convergent integrals and sum.

A sufficient condition for (4.22) to hold true is that $\bar{C}(\beta)\mathrm{e}^{\beta B}||\rho||_{\infty}\leq\frac{1}{2\mathrm{e}}$ (pick $a=b\equiv\frac{1}{2}$ ). In fact one easily checks that, if we are interested in bounded density profiles only, we are in the situation where a direct application of the Banach inversion theorem (Theorem 2.10) is possible.

Proof.

The absolute convergence of the series in (4.23) follows right away from Theorem 3.4 applied to $\nu(\mathrm{d}x)=\rho(x)\mathrm{d}x$ . By Theorem 3.5, there is a unique measure $z(\mathrm{d}q)$ in the domain of convergence $\mathscr{D}(A)$ such that $\nu(\mathrm{d}q)=\rho(\mathrm{d}q;z)$ , with $\rho(\mathrm{d}q;z)$ the density at activity $z(\mathrm{d}x)$ for the interaction potential $v(x-y)$ . Moreover the activity is given by Eq. (3.12), which after plugging in $\nu(\mathrm{d}q)=\rho(q)\mathrm{d}q$ becomes $z(\mathrm{d}q)=z(q)\mathrm{d}q$ with

[TABLE]

We adopt (4.19) as a definition of the external potential, then $\beta V_{\mathrm{ext}}(q)=\log z_{0}-\log z(q)$ and $V_{\mathrm{ext}}(q)$ is given by (4.23). It satisfies $\rho(q;V_{\mathrm{ext}})=\rho(q)$ by the definition (4.24) of $z(q)$ and $V_{\mathrm{ext}}$ . Condition (4.21) follows rom ( $\mathsf{M}_{b}$ ) as then $|z(q)|\leq|\rho(q)|e^{b(q)}$ and thus (4.22) implies (4.21). ∎

4.3. Mixture of hard spheres

Consider a mixture of hard spheres with radii $R_{1},R_{2},\ldots$ , for example, $R_{k}=k^{1/d}$ . The activity $z_{k}$ of the sphere depends on the type $k$ but otherwise the system is homogeneous. To bring the model into the form from Section 3, let $\mathbb{X}=\mathbb{R}^{d}\times\mathbb{N}$ , with $(x,k)$ representing a sphere of radius $R_{k}$ centered at $x$ . We consider measures $z$ informally given by $z=\oplus_{k\in\mathbb{N}}z_{k}\mathrm{d}x$ . More precisely, $\int_{\mathbb{X}}h\mathrm{d}z=\sum_{k=1}^{\infty}\int_{\mathbb{R}^{d}}h(x,k)z_{k}\mathrm{d}x$ for every non-negative test function $h$ . The interaction is hard core exclusion

[TABLE]

Let $p((z_{k})_{k\in\mathbb{N}})$ be the infinite-volume pressure and $\rho_{k}((z_{j})_{j\in\mathbb{N}}):=z_{k}\frac{\partial p}{\partial z_{k}}((z_{j})_{j\in\mathbb{N}})$ . A sufficient condition for the convergence of the activity expansion of the pressure is

[TABLE]

for some non-negative sequence $(a_{j})_{j\in\mathbb{N}}$ of positive numbers and all $k\in\mathbb{N}$ , as is easily checked from [Uel04].

Theorem 4.6.

Suppose that $(\rho_{k})_{k\in\mathbb{N}}\in\mathbb{C}^{\mathbb{N}}$ satisfies

[TABLE]

for all $k\in\mathbb{N}$ and two sequences $(a_{j})$ , $(b_{j})$ with $b_{j}\geq a_{j}\geq 0$ for all $j\in\mathbb{N}$ . Then there exists a unique sequence $(z_{k})_{k\in\mathbb{N}}$ with $\rho_{j}((z_{k})_{k\in\mathbb{N}})=\rho_{j}$ for all $j\in\mathbb{N}$ and such that condition (4.25) holds. It is given by

[TABLE]

The coefficients $D_{n}$ are given by sums over $2$ -connected graphs as in (3.11). The sum in the exponential in (4.27), with absolute values inside the integral, is bounded by $b_{k}$ .

The theorem is deduced from Theorems 3.4 and 3.5, the details are left to the reader.

4.4. Flexible molecules. Liquid crystals

Finally we come to a system of objects with internal degrees of freedom: we assume that the space $\mathbb{X}$ is of the form $\mathbb{X}=\Lambda\times S$ with $\Lambda\subset\mathbb{R}^{d}$ a bounded domain.333We could also allow for spaces $\mathbb{X}=\sqcup_{k\in\mathbb{N}}(\Lambda\times S_{k})$ representing a multi-species system where each species $k$ has its own spin space $S_{k}$ , but for simplicity we stick to the single-species case. The space $S$ represents internal degrees of freedom (spin, orientation, shape of a molecule…). For example, we could take $S$ as the projective space $\mathbb{P}^{d-1}$ (i.e., $\mathbb{R}^{d}\setminus\{0\}$ with identification of parallel vectors) and think of $(x,\vec{u})$ as a thin rod centered at $x$ with orientiation $\vec{u}$ . Such a model is often used for the study of liquid crystals [Ons49].

Suppose we are given a reference measure $m$ on $\mathbb{X}$ that is of the form $m(\mathrm{d}(x,\sigma))=\mathrm{d}x\,\lambda(\mathrm{d}\sigma)$ , i.e., it is the product of the Lebesgue measure on $\Lambda$ and a reference measure $\lambda$ on $S$ (e.g. a uniform measure on orientations of thin rods). To simplify formulas, we write $\mathrm{d}\sigma$ instead of $\lambda(\mathrm{d}\sigma)$ . The pair potential $V((x,\sigma),(y,\tau))$ is a function of both position and internal degree of freedom.

Following Onsager, one could work in a multi-species canonical ensemble, where each species represents a discretized orientation. In such a setup, deriving the canonical free energy is immediate following [PT12]. One can easily derive a functional for continuous orientations, using our techniques presented here, that is, to start in the grand-canonical ensemble, and obtain the grand-canonical free energy via Legendre transform and inversion of the density-activity relation, which is precisely the definition (3.15) for $\mathcal{F}_{\mathrm{GC}}[\nu]$ . Let us write $\nu(\mathrm{d}(x,\sigma))=\rho(x,\sigma)\mathrm{d}x\mathrm{d}\sigma$ and, by a slight abuse of language, $\mathcal{F}_{\mathrm{GC}}[\rho]$ instead of $\mathcal{F}_{\mathrm{GC}}[\nu]$ .

For simplicity we prove results for non-negative pair potentials $V$ only but note that our general theorems lead just as easily to stable pair potential.

Theorem 4.7.

Let $V\geq 0$ and $\rho:\mathbb{X}\to\mathbb{R}_{+}$ . Suppose there exist weight functions $a,b:\mathbb{X}\to\mathbb{R}_{+}$ with $b\geq a$ . Suppose that $\rho:\Lambda\times S\to\mathbb{R}_{+}$ satisfies

[TABLE]

for all $(x,\sigma)\in\Lambda\times S$ , and

[TABLE]

Then

[TABLE]

with absolutely convergent integrals and sum.

Proof.

The theorem is an immediate consequence of Theorem 3.6. ∎

When we think of rods with an orientiation, we may specialize to situations where there is translational invariance but not necessarily rotational invariance:

Corollary 4.8.

Assume that $\rho(x,\sigma)=\rho_{0}p(\sigma)$ for some scalar $\rho_{0}>0$ and non-negative $p:S\to\mathbb{R}_{+}$ with $\int_{S}p(\sigma)\mathrm{d}\sigma=1$ . Assume that $|\Lambda|<\infty$ , $\int_{S}p(\sigma)|\log p(\sigma)|\,\mathrm{d}\sigma<\infty$ , and

[TABLE]

Then

[TABLE]

with absolutely convergent integral and series.

If $V$ is translation invariant, then the right-hand side of (4.28) is proportional to the volume, up to boundary errors that become irrelevant in the thermodynamic limit, and the corollary also yields an expression for the thermodynamic limit $\lim\frac{1}{|\Lambda|}\beta\mathcal{F}_{\Lambda}[\rho]$ .

The right-hand side of (4.28) corresponds to the functional from Eq. (27) in [Ons49], which is the free energy functional derived by Onsager before applying additional approximations due to thinness of rods etc.

*Remark 4.9**.*

In [JTTU14], in order to obtain $2$ -connected coefficients for the case of molecules with internal degrees of freedom, we needed to assume rigidity of the molecules so that Lemma 4.1 in [JTTU14] about factorization of graph weights holds true. In the present article, as seen in Corollary 4.8, we obtain the $2$ -connected coefficients as well provided we keep the probability density $p(\sigma)$ of shapes as an explicit variable. If instead we look at

[TABLE]

expand the minimizer $p(\sigma;\rho_{0})$ in powers of $\rho_{0}$ and compose with the expansion of $\frac{1}{|\Lambda|}\mathcal{F}_{\Lambda}[\rho_{0}p]$ , we see that the coefficient of $\rho_{0}^{n}$ in the expansion of $f_{\Lambda}(\rho_{0})$ is not given by $D_{n}$ .

Appendix A Formal power series and Ruelle’s algebraic formalism

Here we summarize some facts on the formal power series used in this article, and point out the relation with Ruelle’s algebraic formalism. We are interested in power series and formal power series of the form

[TABLE]

where $(\mathbb{X},\mathcal{X})$ is a measurable space $z$ is a measure on $(\mathbb{X},\mathcal{X})$ , and $K_{0}\in\mathbb{C}$ is a scalar, and $K_{n}:\mathbb{X}^{n}\to\mathbb{C}$ are measurable maps that are invariant under permutation of the arguments.

In general, for a formal power series, the integrals and the series need not to converge, hence, in analogy with the theory of formal power series of a single variable, we define a formal power series as a sequence $(K_{n})_{n\in\mathbb{N}}$ of symmetric functions and downgrade (A.1) to a mnemonic notation. Standard operations such as sums and products are defined directly as operations on the sequences $(K_{n})_{n\in\mathbb{N}_{0}}$ in such a way that for two sufficiently well convergent power series one obtains the same result. The sum of two formal power series $K+G$ is the formal series with coefficients $(K_{n}+G_{n})_{n\in\mathbb{N}_{0}}$ , for $\lambda\in\mathbb{C}$ the formal series $\lambda K$ is the series with coefficients $(\lambda K_{n})_{n\in\mathbb{N}_{0}}$ . Other operations are defined below. The resulting algebra of formal power series is exactly the algebra of symmetric functions introduced by Ruelle [Rue69, Chapter 4.4].

Product. Let $K,G$ be formal power series, then $KG$ is defined by

[TABLE]

The empty set $J=\varnothing$ is explicitly allowed. As an operation on sequences of symmetric functions, this is exactly the convolution in [Rue69, Chapter 4.4]. It is not difficult to check that the product is commutative and associative. Eq. (A.2) generalizes to products $K^{(1)}\cdots K^{(r)}$ as

[TABLE]

where the sum runs over ordered partitions $(V_{1},\ldots,V_{r})$ of $[n]$ into $r$ disjoint parts, with $V_{i}=\varnothing$ explicitly allowed.

The definition (A.2) is motivated by the following computation, which is valid if the power series are absolutely convergent: From

[TABLE]

we get

[TABLE]

The summand for $m=\ell=0$ should be read as $K_{0}G_{0}$ . The binomial coefficient $\binom{n}{m}$ is equal to the number of subsets $J\subset[n]$ of cardinality $\#J=m$ . The value of the integral

[TABLE]

depends on the cardinality $m$ of $J$ alone, and so we find that

[TABLE]

with $(KG)_{n}$ defined in (A.2).

Variational derivative. For $q\in\mathbb{X}$ and $K$ a formal power series over $\mathbb{X}$ , we define

[TABLE]

In the language of [Rue69, Chapter 4.4], $\frac{\delta}{\delta z(q)}$ corresponds to the derivation $D_{q}$ . Formally,

[TABLE]

and

[TABLE]

as it should be.

Composition I and exponential series. Let $F(t)=\sum_{n=0}^{\infty}f_{n}t^{n}/n!$ be a formal power series in a single variable $t$ and $K$ a formal power series on $(\mathbb{X},\mathcal{X})$ with $K_{0}=0$ . The formal power series $F\circ K$ on $\mathbb{X}$ is defined by $(F\circ K)_{0}:=f_{0}$ and for $n\geq 1$ ,

[TABLE]

with $\mathcal{P}_{n}$ the collection of set partitions of $\{1,\ldots,n\}$ . Note that only because $K_{0}=0$ the expression (A.6) is well-defined, because only in this case the sum is finite. Formally,

[TABLE]

In the second line we have used (A.3). Because of $K_{0}=0$ , the only relevant contributions in the last line are from non-empty $J_{r}$ ’s. The factor $1/m!$ can be removed if we decide to sum over non-ordered partitions $\{J_{1},\ldots,J_{m}\}$ instead of ordered partitions $(J_{1},\ldots,J_{r})$ , and we arrive at the expression (A.6) for the coefficients of $F(K(z))$ .

An important special case is $F(t)=\exp(t)$ , for which Eq. (A.6) becomes

[TABLE]

which is exactly the exponential on the algebra of symmetric functions from [Rue69, Chapter 4.4].

Composition II. In the proof of Lemma 2.1 we need a more general type of composition, namely let $K$ be a formal power series on $\mathbb{X}$ with $K_{0}=0$ and $(G(q;z))_{q\in\mathbb{X}}$ a family of power series

[TABLE]

If $G(q;z)$ is absolutely convergent for each $q$ , define

[TABLE]

If sums and integrals are absolutely convergent, then

[TABLE]

where the $V_{i}$ can be empty. We group pairs $(m,r)$ with a common sum $m+r=n$ . For the factorials we note

[TABLE]

Exploiting the symmetry of the functions $K_{m}(\cdot)$ and $G_{j}(x;\cdot)$ , we find that the coefficients of $F$ are given by

[TABLE]

Appendix B Holomorphic functions on Banach spaces

Here we collect some fact that are useful for the Banach inversion. We refer the reader to [Har03, Muj06] for accessible surveys and [Din99, Muj86] for details. Let $E$ and $F$ be two complex Banach spaces. A multilinear map $A:E^{m}\to F$ is bounded if

[TABLE]

Definition B.1 (Homogeneous polynomials and power series).

(1)

A mapping $P:E\to F$ is a continuous $m$ -homogeneous polynomial if there exists a bounded multilinear map $A:E^{m}\to F$ such that $P(x)=A(x,\ldots,x)$ . 2. (2)

A power series from $E$ into $F$ is a series of the form $\sum_{m=0}^{\infty}P_{m}(x-a)$ , with $a\in E$ and $P_{m}$ a continuous $m$ -homogeneous polynomial. The radius of uniform convergence of the series is the supremum over all $r>0$ such that the series converges uniformly on $\{x\in E\mid||x-a||\leq r\}$ .

The norm of a continuous $m$ -homogeneous polynomial $P$ is

[TABLE]

For example, if $E=F=\mathbb{C}$ and $P(z)=a_{m}x^{m}$ , then $||P||=|a_{m}|$ .

Proposition B.2 (Cauchy-Hadamard formula).

[Muj06, Prop. 6]** The radius of uniform convergence of the power series $\sum_{m=0}^{\infty}P_{m}(x-a)$ satisfies

[TABLE]

Theorem B.3.

[Muj06, Theorem 7]** Let $U\subset E$ be a non-empty open subset and $f:U\to F$ . The following conditions are equivalent:

(1)

For each $a\in U$ , the Fréchet derivative of $f$ at $a$ exists: i.e., there exists a bounded linear map $A:E\to F$ such that

[TABLE] 2. (2)

For each $a\in U$ , there exists a power series $\sum_{m=0}^{\infty}P_{m}(x-a)$ that converges to $f(x)$ uniformly on some ball $B(a,r)\subset U$ (with $r>0$ ). 3. (3)

$f$ * is continuous in $U$ and, for each $a\in U$ , all elements $\psi$ of the dual Banach space $E^{\prime}$ , and all $b\in E$ , the map $\lambda\to\psi(f(a+\lambda b))$ is holomorphic in the usual sense in the open set $\{\lambda\in\mathbb{C}\mid a+\lambda b\in U\}$ .*

Definition B.4.

A mapping $f:U\to F$ is called holomorphic if it satisfies one (hence, all three) of the conditions (1)-(3) in Theorem B.3.

Many theorems for holomorphic functions in $\mathbb{C}$ have analogues (for example, Cauchy integral formulas), but there are a few pitfalls. For example, it is not true that the Taylor series of a function holomorphic on all of $E$ has infinite uniform radius of convergence. Also, it is not true that a holomorphic function is bounded on balls that are bounded away from $\partial U$ .

*Example B.5**.*

[Har03, Example 2.6] Let $c_{0}(\mathbb{N})$ be the Banach space of complex-valued sequences that converge to zero, equipped with the usual supremum norm. Define $f:c_{0}(\mathbb{N})\to\mathbb{C}$ by

[TABLE]

Then $f$ is holomorphic on all of $c_{0}(\mathbb{N})$ , but the radius of uniform convergence (in the sense of Definition B.1) of the series is $1$ , and for every $r>1$ , the function $f$ is unbounded on the ball $\{z\in c_{0}(\mathbb{N})\mid\sup_{n\in\mathbb{N}}|z_{n}|\leq r\}$ .

We conclude with a quantitative inverse function theorem. Let $U$ and open subset of $E$ and $h:U\rightarrow F$ . Call $V:=h(U)$ . An inverse function theorem give condition under which there exist open neighborhoods $U^{\prime}\subset U$ of [math] and $V^{\prime}\subset V$ of $h(0)$ , respectively, such that $h:U^{\prime}\to V^{\prime}$ is bijection with holomorphic inverse. An quantitative inverse function theorem additionally singles out numbers $r>0$ and $P>0$ , which only depends on $U,V,\|Dh(0)^{-1}\|$ , for which we may choose $U^{\prime}=B_{r}(0)$ and $V^{\prime}=h(U^{\prime})\supset B_{P}(0)$ . Alternatively, one can have quantitative inversion theorem such that $V^{\prime}=B_{P}(0)$ and $U^{\prime}=h^{-1}(B_{P}(0))\subset B_{r}(0)$ . Such numbers $r$ and $P$ are sometimes called Bloch radii after Bloch’s theorem. In the following theorem $E=F$ .

Theorem B.6.

[Har77, Proposition 2]** Let $B_{R}(0)$ and $B_{M}(0)$ be open balls in some complex Banach space $E=F$ and $h:B_{R}(0)\rightarrow B_{M}(0)$ a holomorphic function. Suppose that the derivative $\mathrm{D}h(0)$ at the origin is invertible with bounded inverse $||\mathrm{D}h(0)^{-1}||^{-1}\geq a>0$ . Let

[TABLE]

Then $h$ maps $B_{r}(0)$ biholomorphically onto a domain covering $B_{P}(h(0))$ .

Acknowledgments

The main part of this article was completed when the first and third authors were members of the Department of Mathematics at the University of Sussex and the second was frequently visiting; the authors acknowledge the department for the nice atmosphere. S. J. thanks the GSSI and T.K. the university in L’Aquila, Italy, for hospitality and M. Lewin for pointing out possible connections with the setting of the Nash-Moser theorem.

Bibliography35

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[BLL 98] F. Bergeron, G. Labelle, and P. Leroux, Combinatorial species and tree-like structures , Encyclopedia of mathematics and its applications, vol. 67, Cambridge University Press, 1998.
2[CCL 84] J. T. Chayes, L. Chayes, and E. H. Lieb, The inverse problem in classical statistical mechanics , Comm. Math. Phys. 93 (1984), no. 1, 57–121.
3[Din 99] S. Dineen, Complex analysis on infinite dimensional spaces , Springer Monographs in Mathematics, Springer, 1999.
4[Far 12] W. G. Faris, Biconnected graphs and the multivariate virial expansion , Markov Processes and Related Fields 18 (2012), no. 3, 357–386.
5[FPS 07] R. Fernández, A. Procacci, and B. Scoppola, The analyticity region of the hard sphere gas. Improved bounds , J. Stat. Phys. 128 (2007), no. 5, 1139–1143.
6[Ges 87] I. M. Gessel, A combinatorial proof of the multivariable Lagrange inversion formula , J. Combin. Theory Ser. A 45 (1987), no. 2, 178–195.
7[Ham 82] R. S. Hamilton, The inverse function theorem of Nash and Moser , Bull. Amer. Math. Soc. 7 (1982), no. 1, 65–122.
8[Har 77] L. A. Harris, On the size of balls covered by analytic transformations , Monatshefte für Mathematik 83 (1977), no. 1, 9–23.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Virial inversion and density functionals

Abstract.

Contents

1. Introduction

2. General inversion theorems

2.1. Main inversion theorem with proof

Lemma 2.1**.**

Proof.

Remark 2.2*.*

Theorem 2.3**.**

Proof.

Remark 2.4*.*

Theorem 2.5**.**

Proof.

Proposition 2.6**.**

Proof.

2.2. Scale of Banach spaces. Banach inversion

Example 2.7*.*

Proposition 2.8**.**

Corollary 2.9**.**

Theorem 2.10** (Banach inversion).**

Proof.

Remark 2.11*.*

Proof of P′=8PP^{\prime}=8PP′=8P.

Proof of Proposition 2.8.

2.3. An equivalent fixed point equation

Lemma 2.12**.**

Proof.

3. Virial expansion. Density functional

Lemma 3.1**.**

Definition 3.2**.**

Remark 3.3* (Physical interpretation of A(q;z)A(q;z)A(q;z)).*

Theorem 3.4**.**

Theorem 3.5**.**

Theorem 3.6**.**

Lemma 3.7**.**

Proof.

Lemma 3.8**.**

Proof.

Lemma 3.9**.**

Proof.

Proof of Theorem 3.4.

Proof of Theorem 3.5.

Proof of Theorem 3.6.

4. Examples

4.1. Homogeneous gas

Theorem 4.1**.**

Remark 4.2* (Attractive potentials).*

Remark 4.3* (Relation with Lagrange inversion).*

Remark 4.4* (Further improvements for non-negative pair potentials).*

Proof of Theorem 4.1.

4.2. Inhomogeneous gas

Theorem 4.5**.**

Proof.

4.3. Mixture of hard spheres

Theorem 4.6**.**

4.4. Flexible molecules. Liquid crystals

Theorem 4.7**.**

Proof.

Corollary 4.8**.**

Remark 4.9*.*

Appendix A Formal power series and Ruelle’s algebraic formalism

Appendix B Holomorphic functions on Banach spaces

Definition B.1** (Homogeneous polynomials and power series).**

Proposition B.2** (Cauchy-Hadamard formula).**

Theorem B.3**.**

Definition B.4**.**

Example B.5*.*

Theorem B.6**.**

Acknowledgments

Lemma 2.1.

*Remark 2.2**.*

Theorem 2.3.

*Remark 2.4**.*

Theorem 2.5.

Proposition 2.6.

*Example 2.7**.*

Proposition 2.8.

Corollary 2.9.

Theorem 2.10 (Banach inversion).

*Remark 2.11**.*

Proof of $P^{\prime}=8P$ .

Lemma 2.12.

Lemma 3.1.

Definition 3.2.

*Remark 3.3** (Physical interpretation of $A(q;z)$ ).*

Theorem 3.4.

Theorem 3.5.

Theorem 3.6.

Lemma 3.7.

Lemma 3.8.

Lemma 3.9.

Theorem 4.1.

*Remark 4.2** (Attractive potentials).*

*Remark 4.3** (Relation with Lagrange inversion).*

*Remark 4.4** (Further improvements for non-negative pair potentials).*

Theorem 4.5.

Theorem 4.6.

Theorem 4.7.

Corollary 4.8.

*Remark 4.9**.*

Definition B.1 (Homogeneous polynomials and power series).

Proposition B.2 (Cauchy-Hadamard formula).

Theorem B.3.

Definition B.4.

*Example B.5**.*

Theorem B.6.