Representation of Complex Probabilities

L.L. Salcedo

arXiv:hep-lat/9607044·hep-lat·October 28, 2009

Representation of Complex Probabilities

L.L. Salcedo

PDF

TL;DR

This paper demonstrates that every complex probability distribution can be represented by a real, positive distribution in a higher-dimensional space, providing explicit constructions for various classes of complex distributions.

Contribution

It introduces a constructive method to find positive real representations for any complex probability distribution, extending the applicability of probabilistic modeling.

Findings

01

Every complex probability admits a real positive representation.

02

Explicit representations are provided for Gaussian times polynomial distributions.

03

Constructive methods work in any number of dimensions.

Abstract

Let a ``complex probability'' be a normalizable complex distribution $P (x)$ defined on $R^{D}$ . A real and positive probability distribution $p (z)$ , defined on the complex plane $\C^{D}$ , is said to be a positive representation of $P (x)$ if $⟨ Q (x) ⟩_{P} = ⟨ Q (z) ⟩_{p}$ , where $Q (x)$ is any polynomial in $R^{D}$ and $Q (z)$ its analytical extension on $\C^{D}$ . In this paper it is shown that every complex probability admits a real representation and a constructive method is given. Among other results, explicit positive representations, in any number of dimensions, are given for any complex distribution of the form Gaussian times polynomial, for any complex distributions with support at one point and for any periodic Gaussian times polynomial.

Equations140

⟨ O (x) ⟩_{P} = \frac{⟨ O ( x ) F ( x ) ⟩ _{P_{0}}}{⟨ F ( x ) ⟩ _{P_{0}}} .

⟨ O (x) ⟩_{P} = \frac{⟨ O ( x ) F ( x ) ⟩ _{P_{0}}}{⟨ F ( x ) ⟩ _{P_{0}}} .

⟨ Q (x) ⟩_{P} = \frac{\int Q ( x ) P ( x ) \mbox d ^{D} x}{\int P ( x ) \mbox d ^{D} x} .

⟨ Q (x) ⟩_{P} = \frac{\int Q ( x ) P ( x ) \mbox d ^{D} x}{\int P ( x ) \mbox d ^{D} x} .

⟨ q (z) ⟩_{p} = \frac{\int q ( z ) p ( z ) \mbox d ^{2 D} z}{\int p ( z ) \mbox d ^{2 D} z} .

⟨ q (z) ⟩_{p} = \frac{\int q ( z ) p ( z ) \mbox d ^{2 D} z}{\int p ( z ) \mbox d ^{2 D} z} .

∣ det (A) ∣^{2}

∣ det (A) ∣^{2}

\int Q (z) \partial_{k}^{*} ϕ (z) \mbox d^{2 D} z = \int \partial_{k}^{*} (Q (z) ϕ (z)) \mbox d^{2 D} z = 0,

\int Q (z) \partial_{k}^{*} ϕ (z) \mbox d^{2 D} z = \int \partial_{k}^{*} (Q (z) ϕ (z)) \mbox d^{2 D} z = 0,

\int Q (z) \partial_{k} p (z) \mbox d^{2 D} z

\int Q (z) \partial_{k} p (z) \mbox d^{2 D} z

\int Q (z) R (z) p (z) \mbox d^{2 D} z

⟨ z_{i_{1}} \dots z_{i_{n}} ⟩_{p_{1} * p_{2}}

⟨ z_{i_{1}} \dots z_{i_{n}} ⟩_{p_{1} * p_{2}}

=

\int z_{i_{1}} \dots z_{i_{n}} C (z) \mbox d^{2 D} z = δ_{n, 0},

\int z_{i_{1}} \dots z_{i_{n}} C (z) \mbox d^{2 D} z = δ_{n, 0},

P (x) = \int p (x - i y, y) \mbox d^{D} y .

P (x) = \int p (x - i y, y) \mbox d^{D} y .

P (x) = \tilde{f} (x) exp (- \frac{1}{2} m_{ij} x_{i} x_{j})

P (x) = \tilde{f} (x) exp (- \frac{1}{2} m_{ij} x_{i} x_{j})

p (z) = det (m) f (m y) exp (- \frac{1}{2} m_{ij} z_{i} z_{j}^{*}),

p (z) = det (m) f (m y) exp (- \frac{1}{2} m_{ij} z_{i} z_{j}^{*}),

\tilde{P} (k) = \int e^{ik x} P (x) \mbox d^{D} x

\tilde{P} (k) = \int e^{ik x} P (x) \mbox d^{D} x

\tilde{P} (k) = n = 0 \sum \infty \frac{i ^{n}}{n !} k_{i_{1}} \dots k_{i_{n}} ⟨ x_{i_{1}} \dots x_{i_{n}} ⟩_{P}

\tilde{P} (k) = n = 0 \sum \infty \frac{i ^{n}}{n !} k_{i_{1}} \dots k_{i_{n}} ⟨ x_{i_{1}} \dots x_{i_{n}} ⟩_{P}

\langle x_{i_{1}}\cdots x_{i_{n}}\rangle_{P}=(-i)^{n}\partial_{i_{1}}\cdots\partial_{i_{n}}{\tilde{P}}(k)\big{|}_{k=0}\,.

\langle x_{i_{1}}\cdots x_{i_{n}}\rangle_{P}=(-i)^{n}\partial_{i_{1}}\cdots\partial_{i_{n}}{\tilde{P}}(k)\big{|}_{k=0}\,.

\tilde{p} (σ) = \int e^{ik x + i r y} p (z) \mbox d^{2 D} z = \int e^{i (σ^{*} z + σ z^{*}) /2} p (z) \mbox d^{2 D} z,

\tilde{p} (σ) = \int e^{ik x + i r y} p (z) \mbox d^{2 D} z = \int e^{i (σ^{*} z + σ z^{*}) /2} p (z) \mbox d^{2 D} z,

\langle z_{i_{1}}\cdots z_{i_{n}}z^{*}_{j_{1}}\cdots z^{*}_{j_{m}}\rangle_{p}=(-2i)^{n+m}\partial^{*}_{i_{1}}\cdots\partial^{*}_{i_{n}}\partial_{j_{1}}\cdots\partial_{j_{m}}{\tilde{p}}(\sigma)\big{|}_{\sigma=0}

\langle z_{i_{1}}\cdots z_{i_{n}}z^{*}_{j_{1}}\cdots z^{*}_{j_{m}}\rangle_{p}=(-2i)^{n+m}\partial^{*}_{i_{1}}\cdots\partial^{*}_{i_{n}}\partial_{j_{1}}\cdots\partial_{j_{m}}{\tilde{p}}(\sigma)\big{|}_{\sigma=0}

\tilde{p} (σ) = \tilde{C} (σ) \tilde{P} (\frac{σ ^{*}}{2}) (\tilde{P} (- \frac{σ ^{*}}{2}))^{*} .

\tilde{p} (σ) = \tilde{C} (σ) \tilde{P} (\frac{σ ^{*}}{2}) (\tilde{P} (- \frac{σ ^{*}}{2}))^{*} .

⟨ z_{i_{1}} \dots z_{i_{n}} ⟩_{p}

⟨ z_{i_{1}} \dots z_{i_{n}} ⟩_{p}

=

P (x) = δ (x) + a δ^{'} (x), a = a_{R} + i a_{I} \in \mbox C .

P (x) = δ (x) + a δ^{'} (x), a = a_{R} + i a_{I} \in \mbox C .

\tilde{p} (σ) = 1 - \frac{1}{4} ∣ a ∣^{2} ∣ σ ∣^{2} - \frac{i}{2} (a^{*} σ + a σ^{*})

\tilde{p} (σ) = 1 - \frac{1}{4} ∣ a ∣^{2} ∣ σ ∣^{2} - \frac{i}{2} (a^{*} σ + a σ^{*})

p (z) = δ (x) δ (y) + a_{R} δ^{'} (x) δ (y) + a_{I} δ (x) δ^{'} (y) + \frac{1}{4} ∣ a ∣^{2} (δ^{''} (x) δ (y) + δ (x) δ^{''} (y)) .

p (z) = δ (x) δ (y) + a_{R} δ^{'} (x) δ (y) + a_{I} δ (x) δ^{'} (y) + \frac{1}{4} ∣ a ∣^{2} (δ^{''} (x) δ (y) + δ (x) δ^{''} (y)) .

p (z) = \int C_{0} (x - \frac{x _{1} + x _{2}}{2}, y - \frac{x _{1} - x _{2}}{2 i}) P (x_{1}) P^{*} (x_{2}) \mbox d^{D} x_{1} \mbox d^{D} x_{2},

p (z) = \int C_{0} (x - \frac{x _{1} + x _{2}}{2}, y - \frac{x _{1} - x _{2}}{2 i}) P (x_{1}) P^{*} (x_{2}) \mbox d^{D} x_{1} \mbox d^{D} x_{2},

P (x) = i = 1 \sum N a_{i} δ (x - x^{(i)}), C (z) = exp (- \frac{z _{j} z _{j}^{*}}{2Γ}),

P (x) = i = 1 \sum N a_{i} δ (x - x^{(i)}), C (z) = exp (- \frac{z _{j} z _{j}^{*}}{2Γ}),

p (z) = i, j = 1 \sum N a_{i} a_{j}^{*} exp (- \frac{1}{2Γ} ((x - \frac{x ^{(i)} + x ^{(j)}}{2})^{2} + (y - \frac{x ^{(i)} - x ^{(j)}}{2 i})^{2})) .

p (z) = i, j = 1 \sum N a_{i} a_{j}^{*} exp (- \frac{1}{2Γ} ((x - \frac{x ^{(i)} + x ^{(j)}}{2})^{2} + (y - \frac{x ^{(i)} - x ^{(j)}}{2 i})^{2})) .

G (x)

G (x)

N_{G}

G (x) = (2 π)^{- D /2} exp (- \frac{1}{2} x_{i} x_{i})

G (x) = (2 π)^{- D /2} exp (- \frac{1}{2} x_{i} x_{i})

g (z)

g (z)

=

γ = \frac{1}{4 η ( η + 1 )}, \overset{γ}{ˉ} = \frac{2 η + 1}{4 η ( η + 1 )} .

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

REPRESENTATION OF COMPLEX PROBABILITIES††thanks: This work is supported in part by funds provided by the U.S.

Department of Energy (D.O.E.) under cooperative research agreement #DF-FC02-94ER40818 and Spanish DGICYT grant no. PB92-0927.

L.L. Salcedo***Email address: [email protected]

Center for Theoretical Physics

Laboratory for Nuclear Science

and Department of Physics

Massachusetts Institute of Technology

Cambridge, Massachusetts 02139, U.S.A.

and

Departamento de Física Moderna

Universidad de Granada

E-18071 Granada, Spain

Abstract

Let a “complex probability” be a normalizable complex distribution $P(x)$ defined on ${\mbox{\rm R}}^{D}$ . A real and positive probability distribution $p(z)$ , defined on the complex plane ${\mbox{\rm C}}^{D}$ , is said to be a positive representation of $P(x)$ if $\langle Q(x)\rangle_{P}=\langle Q(z)\rangle_{p}$ , where $Q(x)$ is any polynomial in ${\mbox{\rm R}}^{D}$ and $Q(z)$ its analytical extension on ${\mbox{\rm C}}^{D}$ . In this paper it is shown that every complex probability admits a real representation and a constructive method is given. Among other results, explicit positive representations, in any number of dimensions, are given for any complex distribution of the form Gaussian times polynomial, for any complex distributions with support at one point and for any periodic Gaussian times polynomial.

pacs:

PACS 11.15.Ha 02.70.Lq 02.60.Cb 02.50.Ey

I Introduction

In quantum physics there are instances of averages where the role of probability distribution is played by a distribution taking complex values. Consider the functional integral formulation of field theory [1]. There, the time ordered expectation value of observables takes de form $\langle T{\cal O}[\phi]\rangle=N\int{\cal D}\phi(x)\,e^{iS[\phi]}{\cal O}[\phi]$ , where $S[\phi]$ is the action functional and $N$ a normalization constant. This is a first instance of a “complex probability distribution”, namely, the Boltzmann weight $P[\phi]=Ne^{iS[\phi]}$ . In the continuum, such functional integral is not sufficiently well-behaved and only its Euclidean version can be given a rigorous meaning [2]. Within a lattice regularization, the Minkowski version is mathematically well-defined, nevertheless the Wick rotation is performed in this case too. This is because, in most cases, in the Euclidean theory the Boltzmann weight becomes a real and positive probability distribution. This is important in practice since straightforward Monte Carlo is only defined for positive probabilities. There are cases, however, when even Euclidean field theory presents complex actions. Indeed, the statistical interpretation of the quantum theory requires the Boltzmann weight to be reflection positive, but not directly positive [3]. Instances of complex Euclidean actions occur after integration of fermions, since the fermionic determinant is not positive definite; if there are non vanishing chemical potentials; in gauge theories in the presence of Wilson loops or topological $\theta$ -terms or in general after inserting projection operators in the path integral to select particular sectors of the theory [4, 5, 11, 7]. Also, two dimensional fermions can be brought to a bosonic complex action form [8].

As we have said, the computation of averages in the presence of a complex probability distribution poses a practical problem, namely, the Monte Carlo method cannot be used directly to sample the probability since this method only makes sense for true, i.e. real and positive, probabilities. The standard approach to complex probabilities in numerical simulations [4, 5] is to factorize a real and positive part to be used as input for some Monte Carlo method and include the remainder in the observable. That is, if the complex probability is $P(x)=P_{0}(x)F(x)$ with $P_{0}(x)$ positive, the expectation values can be obtained as

[TABLE]

Of course, the same formula can be used when $P(x)$ itself is positive. The problem with this approach is that it violates the importance sample principle, since we are not sampling the true probability and that increases the dispersion of Monte Carlo data. For instance, $\langle F(x)\rangle_{P_{0}}$ may be small, thereby introducing large error bars.

An alternative approach is to look for a positive probability $p(z)$ in the complex configuration space which gives the same expectation values as $P(x)$ , i.e., $\langle{\cal O}(x)\rangle_{P}=\langle{\cal O}(z)\rangle_{p}$ , where ${\cal O}(z)$ is the analytical extension of ${\cal O}(x)$ . The usual way of constructing such a probability is by means of the complex Langevin algorithm [9, 10]. In this approach the configuration is updated through a standard Langevin algorithm with the complex action. Since the drift term is complex, the complex extension of the configuration space is sampled as well. Whenever the random walk possesses an equilibrium configuration, it is sampling the complex configuration space with a real and positive probability distribution $p(z)$ . We have then traded a complex probability $P(x)$ on ${\mbox{\rm R}}^{D}$ by a positive probability $p(z)$ on ${\mbox{\rm C}}^{D}$ . If $p(z)$ happens to be equivalent to $P(x)$ in the sense of expectation values, we have succeeded in sampling the complex probability. Successful implementations of the algorithm have been obtained in some practical cases, such as two dimensional compact QED with static charges [6]. In general, however, the complex Langevin algorithm poses two problems. First, it not always converges to an equilibrium distribution. Second and more subtle, for some actions it seems to converge to an equilibrium distribution which is not equivalent to the original complex probability [11, 12, 13], (see however [14]). Such phenomenon has been found in practically relevant cases such as QCD with a Wilson loop [11, 12, 15].

In the present paper we consider the problem of constructing a positive representation directly, independently of the Langevin algorithm. Several properties of representations of complex probabilities on ${\mbox{\rm R}}^{D}$ by probabilities on ${\mbox{\rm C}}^{D}$ are noted. A constructive method is given to obtain real (although not necessarily positive) representations of very general complex probabilities. Positive representations are explicitly constructed for some probabilities which are beyond the present applicability of the complex Langevin algorithm. These include Gaussian times polynomial, distributions with support at one point, and periodic Gaussian times polynomial. In all cases, such representations are not unique.

These results are of great interest from the point of view of applications. This is not because the constructions found here are of direct usefulness to carry out numerical calculations; there are far more natural ways to compute expectations values with complex Gaussian times polynomial distributions. The interest lies in the following. The negative results found up to now with the complex Langevin algorithm in some systems would make one to have reasonable doubts of whether a positive representation exists at all for those systems. Moreover, the momenta of any positive probability on ${\mbox{\rm C}}^{D}$ are bounded to satisfy some inequalities among them. It might happen that those bounds were incompatible with the momenta of the given complex probability on ${\mbox{\rm R}}^{D}$ in some cases. At present, the necessary and sufficient conditions for a positive representation to exist are not known. The results of this paper suggest, however, that such representation exists quite generally since the set of Gaussian times polynomial is dense in $L^{2}({\mbox{\rm R}}^{D})$ . Our results tend to support the idea that there is no obstruction of principle for positive representations to exits. This is the main insight of this work.

II Representation of complex probabilities

The complex probabilities $P(x)$ to be considered here will be tempered distributions on ${\mbox{\rm R}}^{D}$ of a restricted class, namely, those which are the inverse Fourier transform of an ordinary function ${\tilde{P}}(k)$ (locally integrable and at most of polynomial growth at infinity), with ${\tilde{P}}(k)$ non vanishing at the origin and analytical at that point. These conditions allow for a natural definition of $\int x_{i_{1}}\cdots x_{i_{n}}P(x){\mbox{\rm d}}^{D}x$ through the Taylor expansion of ${\tilde{P}}(k)$ at $k=0$ . In particular $\int P(x){\mbox{\rm d}}^{D}x$ will be non vanishing. The expectation value associated to $P(x)$ is defined for any polynomial $Q(x)$ as

[TABLE]

Likewise, we can consider complex probabilities on ${\mbox{\rm C}}^{D}$ as the class of distributions defined above on ${\mbox{\rm R}}^{2D}$ . For any such distribution, $p(z)$ , the expectation value takes the form

[TABLE]

where $z_{j}=x_{j}+iy_{j}$ , ${\mbox{\rm d}}^{2D}z={\mbox{\rm d}}^{D}x{\mbox{\rm d}}^{D}y$ and $q(z)$ is an arbitrary polynomial of $z$ and its complex conjugate $z^{*}$ .

By definition, $p(z)$ is a representation of $P(x)$ if $\langle Q(x)\rangle_{P}=\langle Q(z)\rangle_{p}$ , where $Q(x)$ is any polynomial on ${\mbox{\rm R}}^{D}$ and $Q(z)$ its analytical extension on ${\mbox{\rm C}}^{D}$ . Equivalently, one can demand $\langle x_{i_{1}}\cdots x_{i_{n}}\rangle_{P}=\langle z_{i_{1}}\cdots z_{i_{n}}\rangle_{p}$ for any set of indices, where $i_{r}=1,\dots,D$ and $n=0,1,2,\dots$ . Two complex probabilities on ${\mbox{\rm C}}^{D}$ will be called equivalent if they have the same expectation values on every analytical polynomial. In general, two equivalent probabilities will not coincide on expectation values of non analytical polynomials $\langle z_{i_{1}}\cdots z_{i_{n}}z^{*}_{j_{1}}\cdots z^{*}_{j_{m}}\rangle$ . A representation will be called real if $p(z)$ is real, positive if $p(z)$ is non negative and unitary if $\int p(z){\mbox{\rm d}}^{2D}z=\int P(x){\mbox{\rm d}}^{D}x$ . Our goal is then to find positive representations of complex probabilities.

We will proceed by noting different ways to obtain new representations from known ones. A first obvious way is by means of complex affine transformations. Let $A$ be a non singular complex $D\times D$ matrix, and $a\in{\mbox{\rm C}}^{D}$ , and assume that $P_{0}(z)$ is an analytical function in a region including ${\mbox{\rm R}}^{D}$ and $A{\mbox{\rm R}}^{D}+a$ such that $P_{0}(x)$ and $P(x)=\det(A)P_{0}(Ax+a)$ are both complex probabilities. Then if $p_{0}(z)$ is a unitary representation of $P_{0}(x)$ so is $p(z)=|\det(A)|^{2}p_{0}(Az+a)$ of $P(x)$ : for any polynomial $Q(x)$

[TABLE]

Furthermore, $p(z)$ is positive if $p_{0}(z)$ is positive. Another construction follows from linear combination. If $p_{i}(z)$ are unitary representations of $P_{i}(x)$ , so is $p(z)=\sum_{i=1}^{n}b_{i}p_{i}(z)$ of $P(x)=\sum_{i=1}^{n}b_{i}P_{i}(x)$ . Again, if $p_{i}(z)$ are positive and $b_{i}$ non negative, $p(z)$ is positive too.

Let us define the partial derivatives $\partial_{k}$ and $\partial^{*}_{k}$ on a function on ${\mbox{\rm C}}^{D}$ as $(\partial/\partial x_{k}\mp i\partial/\partial y_{k})/2$ , respectively and let $\phi(z)$ be in the class of distributions on ${\mbox{\rm C}}^{D}$ defined above but dropping the restriction $\int\phi(z){\mbox{\rm d}}^{2D}z\not=0$ . Then if $p(z)$ is a probability, $p(z)+\partial^{*}_{k}\phi(z)$ is also a probability and in fact (unitarily) equivalent to $p(z)$ ,

[TABLE]

where $Q(z)$ is any analytical polynomial. That is, $\partial^{*}_{k}\phi(z)$ would represent the zero distribution on ${\mbox{\rm R}}^{D}$ . Such distributions will be called null distributions. They will prove useful in what follows to obtain positive representations from real ones, namely, by adding null distribution of the form $\sum_{k=1}^{D}\partial_{k}\partial^{*}_{k}\phi_{k}(z)$ , for suitably chosen real $\phi_{k}(z)$ . Note that $4\partial_{k}\partial^{*}_{k}$ is just a Laplacian.

Similarly, by proceeding as in eq. (6), it follows that if $p(z)$ represents $P(x)$ , the following relations hold

[TABLE]

where $Q(x)$ and $R(x)$ are arbitrary polynomials. That is, $\partial_{k}$ on ${\mbox{\rm C}}^{D}$ represents $\partial_{k}$ on ${\mbox{\rm R}}^{D}$ and multiplication by an analytical polynomial $R(z)$ represents multiplication by $R(x)$ .

Another interesting construction is related to convolutions. The convolution exist for any two complex probabilities since it can be defined through the product of their Fourier transforms which are regular distributions. If $p_{1}(z)$ and $p_{2}(z)$ are unitary representations of $P_{1}(x)$ and $P_{2}(x)$ respectively, their convolution $p_{1}*p_{2}$ is a unitary representation of $P_{1}*P_{2}$ . Indeed, $p_{1}\otimes p_{2}$ is a unitary representation of $P_{1}\otimes P_{2}$ and

[TABLE]

Furthermore, if $p_{1}(z)$ and $p_{2}(z)$ are positive, $p_{1}*p_{2}$ is positive too. In particular, this allows for obtaining equivalent representations of known ones: if $p(z)$ is a unitary representation of $P(x)$ and $C(z)$ is a unitary representation of $\delta(x)$ , the $D$ -dimensional Dirac delta function, $p*C$ will be unitarily equivalent to $p(z)$ , since $P*\delta=P$ . Any probability $C(z)$ normalized to one defines a unitary representation of $\delta(x)$ if it is invariant under global phase rotations, i.e., $C(e^{i\varphi}z)=C(z)$ for any $\varphi\in{\mbox{\rm R}}$ . In this case

[TABLE]

since the angular average of $z_{i_{1}}\cdots z_{i_{n}}$ vanishes for $n>0$ . In fact this construction can be regarded as adding a Laplacian, namely, $p*C-p$ , as it is easily seen after Fourier transform. This procedure can be used to obtain positive representations from real ones. On the other hand, it shows that if a complex probability admits a unitary positive representation it is not unique.

A unitary representation can always be obtained for any $P(x)$ by taking $p(z)=P(x)\delta(y)$ . If $P(x)$ is positive so will be $p(z)$ . This can be generalized as follows. Let $P_{0}(x)$ be positive and $P(x)=P_{0}(x-it)$ , $t\in{\mbox{\rm R}}^{D}$ (i.e., a complex translation under the conditions considered above for affine transformations). Then $p(z)=P_{0}(x)\delta(y-t)$ is a unitary positive representation of $P(x)$ . If we allow $P_{0}$ to depend on $t$ , taking linear combinations we obtain that $p(z)=p(x,y)$ is a unitary representation of

[TABLE]

This relation has been noted before in the literature [16, 13], considered as a projection from probabilities on ${\mbox{\rm C}}^{D}$ to probabilities on ${\mbox{\rm R}}^{D}$ . Note, however, that when this relation can be applied it gives just one of the $P(x)$ represented by $p(z)$ . In fact, since the momenta of $P(x)$ are the Taylor expansion coefficients of its Fourier transform, there are many complex probabilities characterized by the same momenta. As we have seen, under this projection, the operation $\partial^{*}_{i}$ is mapped to zero. Similarly, $\partial_{i}$ is mapped to $\partial/\partial x_{i}$ , and multiplication by an analytical polynomial $Q(z)$ is mapped to multiplication by $Q(x)$ .

As an immediate application of eq. (12), we find that for $m_{ij}$ real, symmetric and positive definite, the probability

[TABLE]

is represented by

[TABLE]

where ${\tilde{f}}$ is the Fourier transform of $f$ (the repeated index convention will be used in what follows). For example, for $D=1$ , and $\Gamma$ positive, $P(x)=\cos(x)\exp(-x^{2}/2\Gamma)$ is represented by the positive probability $p(z)=\exp(-x^{2}/2\Gamma)(\delta(y-\Gamma)+\delta(y+\Gamma))$ . Since in this example $P(x)$ is real but not positive definite, this is an instance where a complex Langevin simulation would fail [12, 15, 13], yet there is a positive representation.

Next, let us show that every complex probability on ${\mbox{\rm R}}^{D}$ admits a real representation. Let $P(x)$ be a complex probability normalized to one and ${\tilde{P}}(k)$ its Fourier transform

[TABLE]

where $kx=k_{i}x_{i}$ . By definition we have

[TABLE]

in a neighborhood of $k=0$ since ${\tilde{P}}(k)$ is analytic at the origin. Also,

[TABLE]

For a probability $p(z)$ on ${\mbox{\rm C}}^{D}$ , the Fourier transform is defined similarly,

[TABLE]

where $\sigma_{i}=k_{i}+ir_{i}$ . Assuming that $p(z)$ is normalized to one, its momenta are obtained through

[TABLE]

where $\partial_{i}$ refers to $\sigma_{i}$ and $\partial^{*}_{j}$ to $\sigma^{*}_{j}$ . Consider the following probability,

[TABLE]

Here $C(z)$ is one of the real unitary representations of $\delta(x)$ above mentioned. Thus ${\tilde{C}}(\sigma)$ is analytical at the origin as a function of $k_{i}$ and $r_{i}$ and is invariant under global phase rotations of $\sigma$ . ${\tilde{P}}(\sigma)$ stands for the analytical extension of ${\tilde{P}}(k)$ in a neighborhood of the origin. Beyond the analyticity circle (if it is finite) we can choose ${\tilde{C}}(\sigma)$ equal to zero so that ${\tilde{p}}(\sigma)$ exists. By construction, ${\tilde{p}}(\sigma)$ is unity at the origin and analytical there. Also it is locally integrable and, with a suitable choice of ${\tilde{C}}(\sigma)$ , grows at most polynomically at infinity, therefore it defines a probability $p(z)$ on ${\mbox{\rm C}}^{D}$ . Furthermore, $p(z)$ is real since $C(z)$ is real and $({\tilde{p}}(\sigma))^{*}={\tilde{p}}(-\sigma)$ . It remains to show that it is a representation of $P(x)$ ,

[TABLE]

where it has been used that $\partial^{*}_{i_{1}}\cdots\partial^{*}_{i_{n}}{\tilde{C}}(\sigma)\big{|}_{\sigma=0}$ vanishes for $n>0$ . That is, we have given a constructive method, eq. (20), to obtain a real representation of any complex probability within the class of complex probabilities considered.

As an illustration, consider $D=1$ and

[TABLE]

In this case ${\tilde{P}}(\sigma)=1-ia\sigma$ is a polynomial, thus it is entire and well-behaved at infinity and we can take ${\tilde{C}}(\sigma)=1$ , i.e., $C(z)=\delta(x)\delta(y)$ . With this choice

[TABLE]

and

[TABLE]

One can easily check that this is a real distribution which represents $P(x)$ , however it is not positive. We can find a positive representation by first applying a convolution (i.e., a better choice of $C(z)$ ) and then adding a suitable Laplacian. Furthermore, it can be done for an arbitrary distribution of support at zero in any number of dimensions. Rather than showing this in detail here, it will be obtained as a byproduct in the next section. There we will obtain positive representations of Gaussian functions times polynomials.

By formally undoing the Fourier transform of ${\tilde{p}}(\sigma)$ in eq. (20), the following explicit form of $p(z)$ is obtained

[TABLE]

where $C_{0}(z_{1},z_{2})$ is the analytical extension of $C_{0}(x,y)=C(x+iy)$ , with $x$ and $y$ real. In order for this formula to make sense, we should require $C_{0}(z_{1},z_{2})$ to be entire on ${\mbox{\rm C}}^{2D}$ and further the integrand should be sufficiently convergent so as to define a probability on ${\mbox{\rm C}}^{D}$ . Such probability is real by construction, since $C(z)$ is real, however it will not be positive in general even if $C(z)$ is positive since such property is lost after analytical extension. The interest of this relation, as compared, for instance with that in eq. (12), is that it is constructive.

An example of application of this formula is provided by

[TABLE]

which gives

[TABLE]

Another application is when $P(x)$ is a finite linear combination of Gaussian distributions centered anywhere in the complex plane and with arbitrary complex widths, provided we choose $\Gamma>|\Gamma_{i}|$ , $i=1,\dots,N$ .

III Positive representations of Gaussian distributions

A Gaussian complex probability takes the general form

[TABLE]

where $m_{ij}$ is a symmetric complex matrix with positive definite real part to ensure normalizability. As a consequence $m_{ij}$ is non singular and can be written as $A_{ki}A_{kj}$ . This allows to set $m_{ij}=\delta_{ij}$ and $b_{i}=0$ by means of a complex affine transformation. That is, we will consider only

[TABLE]

and the general case can be obtained a posteriori as $G(Ax+A^{-1}b)$ . A positive representation of $G(x)$ is simply $G(x)\delta(y)$ . A more general representation $g(z)$ is obtained by convolution with $C(z)=(2\pi\eta)^{-D}\exp(-\frac{1}{2\eta}z_{i}z^{*}_{i})$ , where $\eta$ is positive. This gives

[TABLE]

where the normalization constant is $N_{g}=\left(2\pi\sqrt{\eta(\eta+1)}\right)^{-D}$ and we have introduced the positive numbers

[TABLE]

The same representation is obtained by following the method of eq. (26). The value of the parameter $\eta$ , or equivalently $\bar{\gamma}$ , will be fixed below.

The set of probabilities to be considered is $P(x)=Q(x)G(x)$ , where $Q(x)$ is a complex polynomial of degree $N$ . $P(x)$ can always be written as

[TABLE]

where $a_{i_{1}\dots i_{n}}$ is completely symmetric and the zeroth order coefficient $a_{0}$ must not vanish (in fact, is unity if $P(x)$ is normalized). A real representation of $P(x)$ is given by

[TABLE]

since the terms with $\partial^{*}$ do not contribute and $\partial/\partial z$ is mapped to $\partial/\partial x$ under projection.

It is convenient to introduce the polynomials

[TABLE]

They can be computed recursively by means of the formula

[TABLE]

where we have introduced the variable

[TABLE]

The functions $Q_{i_{1}\dots i_{n}}(z)$ are polynomials of degree $n$ in $\omega_{i}$ , with coefficients depending only on $\gamma$ . With this notation, $p_{0}(z)$ can be rewritten as

[TABLE]

In order to obtain a positive representation, $p(z)$ can be further cast in the form

[TABLE]

where the indices $i_{1}\dots i_{n}$ are summed over. $\beta_{1},\dots,\beta_{N}$ are arbitrary positive numbers which value is to be specified below and we have defined the quantity $|a_{n}|$ as

[TABLE]

We will assume that $|a_{n}|$ is non vanishing, since the vanishing case is trivial. In Appendix A it is shown that

[TABLE]

is a null distribution, where

[TABLE]

By removing $\phi_{n}(z)$ from $p_{0}(z)$ we obtain an equivalent representation $p(z)$ , namely,

[TABLE]

To ensure positivity of $p(z)$ we require

[TABLE]

This can be achieved by choosing the positive coefficients $\beta_{n}$ so as to minimize the left-hand side,

[TABLE]

In this way the inequality is satisfied for any $\bar{\gamma}$ smaller than the unique positive solution of

[TABLE]

For this choice of $\bar{\gamma}$ , $p(z)$ takes the simple form

[TABLE]

To summarize, any Gaussian times polynomial complex probability, eq. (35), admits a positive representation, namely, $p(z)$ in eq. (50), with $\beta_{n}$ given by eq. (48), and $\bar{\gamma}$ given by eq. (49).

Incidentally, let us note that from a computational point of view, it is convenient to minimize the width of $p(z)$ in the complex plane (e.g., if $P(x)$ is already positive, the best choice is $P(x)\delta(y)$ ), since this reduces the dispersion of points in the sample. In the family of probabilities described by the expression of $p(z)$ in eq. (46), this minimization corresponds to our choice of $\beta_{n}$ in eq. (48) and $\bar{\gamma}$ in eq. (49). In general, however, this needs not be best equivalent positive representation of $P(x)$ . The construction presented above corresponds to adding to $p_{0}(z)$ a Laplacian of the form $\partial_{i_{1}}\cdots\partial_{i_{n}}\partial^{*}_{i_{1}}\cdots\partial^{*}_{i_{n}}g(z)$ (as can be seen using the formulas of Appendix A). More generally, one could add terms of the form $b_{i_{1}\dots i_{n};j_{1}\dots j_{n}}\partial_{i_{1}}\cdots\partial_{i_{n}}\partial^{*}_{j_{1}}\cdots\partial^{*}_{j_{n}}g(z)$ , with $b$ self-adjoint, in order to optimize $p(z)$ , or even more general terms so long as they have a $\partial^{*}_{j}$ and are real.

Let us now come back to the problem of finding positive representations of complex distributions with support at 0. Such distributions take the form

[TABLE]

This distribution can be considered as the zero width limit of the Gaussian times polynomial distribution.

[TABLE]

Naming $P(x;a)$ the probability in eq. (35), we find

[TABLE]

Therefore, the positive representation of $P(x;a)$ , namely, $p(z;a)$ in eq. (50), provides a positive representation of $P_{\lambda}(x)$ ,

[TABLE]

In order to take the limit, we should consider how the different variables scale. We already have the scaling law of $z$ and of the coefficients $a_{i_{1}\dots i_{n}}$ . From eqs. (48,49) $\beta^{\lambda}_{n}$ is found to scale as $\lambda^{-2n}\beta_{n}$ and $\bar{\gamma}^{\lambda}$ as $\lambda^{2}\bar{\gamma}$ . From eqs. (34), $\eta^{\lambda}$ is given in leading order by $\lambda^{-2}\eta$ with $\eta=1/(2\bar{\gamma})$ and $\gamma^{\lambda}$ is of order $\lambda^{4}$ and can be neglected. Therefore, in leading order $\lambda^{-2D}g(z/\lambda;\bar{\gamma}^{\lambda})$ becomes

[TABLE]

and is independent of $\lambda$ . This results is to be used in eq. (50). Finally, in leading order, $Q_{i_{1}\dots i_{n}}(z/\lambda;\bar{\gamma}^{\lambda})$ becomes $\lambda^{n}Q^{0}_{i_{1}\dots i_{n}}(z;\bar{\gamma})$ with

[TABLE]

To summarize, any complex distribution with support at a single point, eq. (51), admits a positive representation, namely,

[TABLE]

with $\beta_{n}$ given by eq. (48), and $\bar{\gamma}$ given by eq. (49).

As an illustration we can consider again the distribution of eq. (23). In this case we find $\bar{\gamma}=(4|a|^{2})^{-1}$ and $\eta=\beta_{1}=2|a|^{2}$ , and thus

[TABLE]

As a final application of the results of this section, we can consider periodic probabilities. Such probabilities correspond to variables effectively defined in a compact domain and find application in the context of compact gauge theories on the lattice. They satisfy, $P(x)=P(x-na)$ with $(na)_{i}=n_{i}a_{i}$ where $n\in{\mbox{\rm Z}}^{D}$ is arbitrary and $a\in{\mbox{\rm R}}_{+}^{D}$ is characteristic of $P(x)$ . Without loss of generality, we may choose $a_{i}=2\pi$ . These probabilities do not belong to the class previously considered. The normalization as well as the expectation values should be taken on a lattice cell $\{x,0\leq x_{i}<2\pi,i=1,\dots,D\}$ . The test functions should be periodic and the concept of representation should be modified accordingly: $p(z)$ is periodic on the real axis, $x$ is to be integrated on the periodic cell and $y$ on ${\mbox{\rm R}}^{D}$ . Also instead of equality of expectation values of polynomials we demand $\langle\exp(in_{j}x_{j})\rangle_{P}=\langle\exp(in_{j}z_{j})\rangle_{p}$ for any integers $n_{j}$ , $j=1,\dots,D$ . Assume now that the periodic distribution is a function of the form

[TABLE]

where the series is uniformly convergent. Let $p_{0}(z)$ be a function which is a positive representation of $P_{0}(x)$ not only on polynomials but also on exponential test functions, and such that

[TABLE]

is uniformly convergent. Then, $p(z)$ is a positive representation of $P(x)$ , as is readily shown.

In particular, $P_{0}(x)$ may be a Gaussian times polynomial and $p_{0}(z)$ its positive representation found above, since these functions are sufficiently convergent at infinity. Therefore the construction given above provides a positive representation for this case too. Another example is the periodic version of the one dimensional Gaussian times cosine considered above after eq. (14):

[TABLE]

This example is interesting since it is similar to simplified probabilities considered in the literature [12, 15] to model the SU(2) gauge theory in the presence of a Wilson loop, for which the complex Langevin algorithm did not work.

IV Concluding remarks

We have studied the problem of representation of complex distributions by distributions on the analytically extended complex plane. The positive representation problem is of immediate interest in some areas of physics: field theory and statistical mechanics. On the other hand it also seems a new and interesting field from the mathematical point of view. One could consider extending the particular class of complex distributions studied here, namely, Fourier transforms of regular distributions analytical at the origin, by allowing as well for adding non regular distributions with support outside the origin. Perhaps more interesting, and in the opposite direction, one could extend the set of test functions in the definition of representation beyond polynomials to insure, for instance, that each probability on ${\mbox{\rm C}}^{D}$ is at most the representation of one probability on ${\mbox{\rm R}}^{D}$ . From the viewpoint of applications it would also be interesting to extend the concept of representations to distributions defined on group manifolds since they appear naturally in lattice gauge theories. Our discussion on periodic distributions corresponds in fact to the manifold of the direct product of $D$ U(1) factors.

V Acknowledgments

I would like to thank C. García-Recio for comments on the manuscript.

A

In this appendix we will show that $\phi_{n}(z)$ defined in eq. (44) is a null distribution. To this end let us introduce the polynomials

[TABLE]

They generalize $Q_{i_{1}\dots i_{n}}(z)$ and satisfy the relation

[TABLE]

To prove eq. (44), we will use the following Wick theorem:

[TABLE]

where the sum is over all possible sets of contractions of the indices $i_{1}\dots i_{n}$ with the indices $j_{1}\dots j_{m}$ . The contraction of two indices $i$ , $j$ gives a factor $\bar{\gamma}\delta_{ij}$ and removes them from the list, e.g.,

[TABLE]

In general there are $n!m!/k!(n-k)!(m-k)!$ terms with $k$ contractions. Let us apply the Wick theorem to $Q_{i_{1}\dots i_{n}}(z)Q^{*}_{j_{1}\dots j_{n}}(z)g(z)$ . Whenever two indices $i,j$ are not contracted we will have $Q_{i\dots;j\dots}(z)g(z)$ which contains $\partial^{*}_{j}$ and hence is a null distribution. Therefore only the terms with all indices contracted contribute and the non null part is

[TABLE]

where the sum runs over all permutations. After contracting the indices we obtain eq. (44). $K_{n}(D)$ is the number of ways of choosing $n$ objects out of $D$ allowing repetitions.

The Wick theorem can be proven by induction. Defining the operator

[TABLE]

( $g(z)$ is a multiplicative operator here) we have

[TABLE]

where $Q_{0}(z)=1$ . Trivially, $[\partial_{i},{\cal D}^{*}_{j}]=-\bar{\gamma}\delta_{ij}$ , thus

[TABLE]

where the hat means that the index has been removed from the list. On the other hand ${\cal D}(AB)=({\cal D}A)B-A\partial B$ . The Wick theorem holds for $n=m=0$ . Assuming it has been proven up to some $(n,m)$ ,

[TABLE]

Using that the theorem holds for $(n,m)$ and eq. (A9),

[TABLE]

The first term contains all the contractions not involving the index $i_{n+1}$ , and the second one all the contractions involving the index $i_{n+1}$ , hence the theorem is proven for $(n+1,m)$ . It is worth noticing that the reverse expansion also holds, i.e.,

[TABLE]

where the contraction of $ij$ now is $-\bar{\gamma}\delta_{ij}$ .

Bibliography16

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] P. Ramond, “Field theory: a modern primer”, (Addison Wesley. 1990).
2[2] J. Glimm and A. Jaffe, “Path integral approach to quantum physics”, (Springer Verlag, 1994).
3[3] K. Osterwalder and R. Schrader, Comm. Math. Phys. 31, 83 (1973); ibid. 42, 281 (1974).
4[4] M. Fukugita and I. Niuya, Phys. Lett. B 132, 374 (1983).
5[5] G. Bhanot, R. Dashen, N. Seiberg and H. Levine, Phys. Rev. Lett. 53, 519 (1984).
6[6] J. Ambjørn, M. Flensburg and C. Peterson, Phys. Lett. B 159, 335 (1985).
7[7] J. Ambjørn and S.-K. Yang, Nucl. Phys. B 275 [FS 17], 18 (1986).
8[8] J.R. Klauder and S. Lee, Phys. Rev. D 45, 2101 (1992).