A dynamic model for the two-parameter Dirichlet process
Shui Feng, Wei Sun

TL;DR
This paper develops a stochastic dynamic model for the two-parameter Dirichlet process, establishing the existence of a reversible diffusion process with this distribution as its stationary measure, and discusses properties including Mosco convergence.
Contribution
It constructs a diffusion process associated with the two-parameter Dirichlet process on general spaces and proves its key properties, including closability and Mosco convergence.
Findings
The bilinear form is closable and its closure is a quasi-regular Dirichlet form.
The associated diffusion process is time-reversible with the Dirichlet process as stationary distribution.
Proves Mosco convergence of projection forms for general spaces.
Abstract
Let , , and be a probability measure on a type space . In this paper, we investigate the stochastic dynamic model for the two-parameter Dirichlet process . If , we show that the bilinear form \begin{eqnarray*} \left\{ \begin{array}{l} {\cal E}(F,G)=\frac{1}{2}\int_{{\cal P}_1(\mathbb{N})}\langle \nabla F(\mu),\nabla G(\mu)\rangle_{\mu} \Pi_{\alpha,\theta,\nu_0}(d\mu),\ \ F,G\in {\cal F},\\ {\cal F}=\{F(\mu)=f(\mu(1),\dots,\mu(d)):f\in C^{\infty}(\mathbb{R}^d), d\ge 1\} \end{array} \right. \end{eqnarray*} is closable on and its closure is a quasi-regular Dirichlet form. Hence is associated with a diffusion process in which is time-reversible with the stationary distribution…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBayesian Methods and Mixture Models · Stochastic processes and statistical mechanics · Mathematical Dynamics and Fractals
A dynamic model for the two-parameter
Dirichlet process
Shui Feng
Department of Mathematics and Statistics, McMaster University,
Hamilton, L8S 4K1, Canada
E-mail: [email protected]
Wei Sun
Department of Mathematics and Statistics, Concordia University,
Montreal, H3G 1M8, Canada
E-mail: [email protected]
Abstract Let , , and be a probability measure on a type space . In this paper, we investigate the stochastic dynamic model for the two-parameter Dirichlet process . If , we show that the bilinear form
[TABLE]
is closable on and its closure is a quasi-regular Dirichlet form. Hence is associated with a diffusion process in which is time-reversible with the stationary distribution . If is a general locally compact, separable metric space, we discuss properties of the model
[TABLE]
In particular, we prove the Mosco convergence of its projection forms.
Keywords Two-parameter Dirichlet process, dynamic model, Dirichlet form, closability, Mosco convergence.
1 Introduction
For any and , let , , be a sequence of independent random variables such that has distribution. Set
[TABLE]
and let denote in descending order. The distribution of is called the two-parameter GEM distribution, denoted by . The law of is called the two-parameter Poisson-Dirichlet distribution, denoted by ([17]). For a locally compact, separable metric space , and a sequence of i.i.d. -valued random variables , , with common distribution on , let
[TABLE]
Hereafter, we denote by the Dirac delta measure at for . The distribution of , denoted by or , is called the two-parameter Dirichlet process. Both and carry the information on proportions only while contains information on both proportions and types or labels.
The two-parameter models are natural generalizations to the case . Specifically , and correspond to the well known Poisson-Dirichlet distribution, the GEM distribution and the Dirichlet process, respectively. The Poisson-Dirichlet distribution was introduced by Kingman in [11] to describe the distribution of gene frequencies in a large neutral population at a particular locus. The component represents the proportion of the -th most frequent allele. The age-ordered proportions follow the GEM distribution. The Dirichlet process first appeared in [8] in the context of Bayesian statistics. It is a pure atomic random measure with masses distributed according to . In the context of population genetics, both the Poisson-Dirichlet distribution and the Dirichlet process appear as approximations to the equilibrium behavior of certain large populations evolving under the influence of mutation and random genetic drift.
Let
[TABLE]
denote the infinite dimensional ordered simplex and
[TABLE]
be the closure of in the product space . In [4] an infinite dimensional diffusion process, the unlabeled infinitely-many-neutral-alleles model, is constructed on with generator
[TABLE]
defined on an appropriate domain. The reversible measure of this process is shown to be .
Let , , , and for . Hereafter, we denote by the set of all bounded Borel measurable functions on , the set of all infinitely differentiable functions on , and the space of all probability measures on the Borel -algebra in . For and , we define
[TABLE]
We write for the function . For , define
[TABLE]
Given we consider the operator of the form
[TABLE]
Then, the Fleming-Viot process (cf. [9] and [5]) with neutral parent independent mutation or the labeled infinitely-many-neutral-alleles model is a pure atomic measure-valued Markov process with generator
[TABLE]
For compact space and diffuse probability , i.e., for every in , it is known ([3]) that the labeled infinitely-many-neutral-alleles model is time-reversible with reversible measure .
It is natural to ask whether these diffusion processes have two-parameter analogues when is positive. Many progresses have been made in this direction over the last decade. In [7], a class of infinite dimensional reversible diffusions is constructed and the reversible measure is . The unlabeled infinitely-many-neutral-alleles model in [4] is generalized to the two-parameter setting in [15] where the generator of the process on appropriate domain has the form
[TABLE]
and the reversible measure turns out to be . The process, called Petrov diffusion, is derived as the continuum limit of a family of up-down Markov chains involving the Chinese restaurant process. Connections to Bayesian statistics and ecology are explored in [18] and [19]. Going back to the context of population genetics, the Petrov diffusion is constructed recently in [2] from a family of the Wright-Fisher diffusions with special selection scheme. In [10], two interval partition-valued diffusions are constructed and the corresponding stationary distributions are and , the two cases that are connected to the excursion intervals of Brownian motion and Brownian bridge ([14], [16]).
The situation is more complex in the construction of the labelled diffusion processes in the two-parameter setting. The only model we know of is the one in [6] where the type space consists of two types. In the case , the Dirichlet process has the partition property, i.e., projection of on any finite partition of the type space is a Dirichlet distribution. Exploring the connection between the Wright-Fisher diffusion and the Dirichlet distribution one can naturally construct the Fleming-Viot process from the finite-dimensional Wright-Fisher diffusions. When is positive, the projection on any finite partition of has a complicated distribution in general, and finite dimensional diffusion models are no longer available.
The main objective of this paper is to find a labelled reversible diffusion process with as the reversible measure for certain positive . This can be viewed as a two-parameter generalization of the Fleming-Viot process with parent independent mutation. The range of parameters we consider throughout the paper is and .
In Section 2, we construct the process when the base measure has countable support. Since the partition property does not hold, we will explore the partition structure through Dirichlet forms. This allows us to avoid certain exceptional sets that cause problems in the representation of generators. In Section 3, we consider the general type space with diffuse base measure. We first show that cylindrical functions do not belong to the domain of the pre-generator of the classical bilinear form. To establish the closability, we consider the relaxation of the bilinear form. The process is then constructed, when is a compact Polish space, by taking the Mosco limit.
2 Dynamic model with atomic base distribution
Throughout this section, let , the set of all natural numbers. We consider the bilinear form
[TABLE]
Theorem 2.1
The bilinear form is closable on and its closure is a quasi-regular Dirichlet form. The diffusion process associated with is time-reversible with the stationary distribution .
Before proving Theorem 2.1, we make some preparation.
For , we define
[TABLE]
Denote , , and . For , we define
[TABLE]
Denote . Following the argument of [1, proof of Lemma 3.1], we get
[TABLE]
where is a positive constant depending only on and . Further, we obtain by symmetry that for each ,
[TABLE]
where is a positive constant depending only on and .
Denote and the interior of . For , we define
[TABLE]
If for some , we regard as a function in by setting for . By (2.6), there exists a constant , which depends on , and is independent of , such that for any ,
[TABLE]
[TABLE]
where is a positive constant depending only on , and .
Proof of Theorem 2.1. Let . We consider the map
[TABLE]
By [1, Theorem 3.1], we have . The induced bilinear form of by the map is given by
[TABLE]
For and , we define
[TABLE]
and
[TABLE]
Denote by the boundary of , the outward pointing unit normal field of , and the induced volume form on the surface . For the face , , we have
[TABLE]
and for the face , we have
[TABLE]
Hence on . Then, we obtain by (2.8) and the divergence theorem that
[TABLE]
Thus, we have
[TABLE]
Now we use the estimate (2.8) to show that is closable on . To this end, let be a sequence satisfying
[TABLE]
Note that
[TABLE]
To show , we need only show that for any fixed ,
[TABLE]
Suppose that with and . By (2.8) and (2.10), we get
[TABLE]
Thus, is closable on .
Following the argument of [20, Proposition 5.11 and Lemma 7.5], we can show that the closure of is a quasi-regular, symmetric, local Dirichlet form on . Therefore, there exists an associated diffusion process in which is time-reversible with the stationary distribution .
Denote by the generator of on . In the following, we will give an explicit expression for .
Theorem 2.2
(i)* .*
(ii)* For each ,*
[TABLE]
(iii)* For , denote by the -limit given in (2.11). Let with and . We have*
[TABLE]
Proof. Let for some and . For with and , we obtain by (2.8) that
[TABLE]
Since is arbitrary, we conclude that by [12, Chapter I, Proposition 2.16].
For , we regard as a function in by setting for . We claim that
[TABLE]
In fact, it is easy to see that
[TABLE]
where is the orthogonal projection of onto the closure of . Since is dense in , we obtain (2.13).
For , let for . By (2.13), we get
[TABLE]
Hence, (2.11) holds and . Therefore, we obtain (2.12) by (2.6) and (2.13).
Remark 2.3
For and , we have
[TABLE]
The eigenvalues of are with multiplicity 1, . It deserves further investigation to characterize the eigenvalues of for .
3 Dynamic model with diffuse base distribution
In this section, let be a general locally compact, separable metric space and a diffuse probability measure on . We consider the classical bilinear form
[TABLE]
If is closable on , then following the argument of [20, Proposition 5.11 and Lemma 7.5], we can show that the closure of is a quasi-regular, symmetric, local Dirichlet form on . Therefore, there exists an associated diffusion process in which is time-reversible with the stationary distribution .
Up to now we still cannot prove that is closable on . In the following, we will discuss properties of the model (3.3). We fix a sequence of partitions of satisfying the following conditions:
(1) , , .
(2) , , .
3.1
Denote by the pre-generator of on . A special feature of the model (3.3) is that . More precisely, we have the following result.
Proposition 3.1
Suppose that . Let . There does not exist such that
[TABLE]
Proof. Let and . For , we define
[TABLE]
Set , . We define as in (2.6). Then, we have
[TABLE]
Following the argument of [1, proof of Lemma 3.1], we get
[TABLE]
and
[TABLE]
Further, we obtain by symmetry that
[TABLE]
and
[TABLE]
[TABLE]
Suppose there exists such that (3.4) holds. For , we define
[TABLE]
and
[TABLE]
Let for some . By (3.4), we get
[TABLE]
Since is arbitrary, we obtain by (3.8) and (3.10) that
[TABLE]
Since is arbitrary, there is a contradiction. Therefore, there does not exist such that (3.4) holds.
3.2 Mosco convergence of projection forms
Since we do not know if is closable on , we consider the relaxation of . By [13, page 373], there exists a greatest lower semicontinuous bilinear form on which is a minorant of . This unique determined closed form is called the relaxation of , denoted by . We have that and for any , and for every ,
[TABLE]
Note that if is closable, then is just the closure of on .
By [13, Corollary 2.8.2], is a Dirichlet form on . Further, if is a compact Polish space, then is a regular Dirichlet form. Hence is associated with a Markov process in which is time-reversible with the stationary distribution . Let be defined as in (3.9) for . In this subsection, we will show that the limit of the sequence is given by .
For , we consider the map
[TABLE]
For with and with , we obtain by (2.10) that
[TABLE]
Hence is closable on by [12, Chapter I, Proposition 3.3]. Denote by the closure of . We have and is an extension of for each .
For , we define the resolvent of by
[TABLE]
where for . Given , the existence and uniqueness of satisfying (3.12) follows from the Riesz representation theorem.
Denote by the strongly continuous contraction resolvent associated with the Dirichlet form on . We have the following characterization of by virtue of .
Theorem 3.2
For every , the sequence of resolvent operators converges to in the strong operator topology.
Proof. We first show that for any subsequence of , there exists a subsequence of such that for every the sequence converges to a resolvent operator.
For , we define
[TABLE]
Denote
[TABLE]
Let be the set of all positive rational numbers. Note that
[TABLE]
By the diagonal argument, there exists a subsequence of such that
[TABLE]
We fix such a subsequence and define
[TABLE]
Let . For and , we have
[TABLE]
Hence
[TABLE]
Thus,
[TABLE]
[TABLE]
For every , by (3.17), we can extend to a continuous contraction operator on . Further, by (3.16), the resolvent equations for , and the density of in , we can obtain a collection of continuous operators on satisfying
(i) .
(ii) .
(iii) .
Let and . By (3.15), we find that is an increasing sequence. Then, we obtain by (3.16) that
[TABLE]
By (i) and (3.18), we get
[TABLE]
Then, we have
[TABLE]
Thus, we obtain by (i), (iii), and the density of in that
(iv) .
By (i), (iii), and (iv), we know that is a strongly continuous contraction resolvent on (cf. [12, Chapter I, Definition 1.4]). Then, there exists a unique symmetric Dirichlet form on such that its resolvent is given by , i.e.,
[TABLE]
By (ii) and [13, Theorem 2.4.1], we find that converges to in the sense of Mosco convergence as , i.e.,
(a) For every converging weakly to in ,
[TABLE]
(b) For every , there exists converging strongly to in , such that
[TABLE]
By (a), we know that is a minorant of . By (b) and (3.11), we obtain that and for . Since is the greatest closed form on which is a minorant of , we get . Then, we obtain by (ii) that
[TABLE]
Since the subsequence of is arbitrary, we get
[TABLE]
As a direct consequence of Theorem 3.2 and [13, Theorem 2.4.1], we obtain the Mosco convergence of projection forms of the model (3.3).
Corollary 3.3
The sequence of bilinear forms converges to in the sense of Mosco convergence.
For , we define the bilinear form as in (2.9). By (2.10), we know that is closable on . Denote by the closure of , the semigroup associated with on , and the orthogonal projection of onto the closure of . For and , we define
[TABLE]
Then, is the semigroup associated with the bilinear form on .
Denote by the strongly continuous contraction semigroup associated with the Dirichlet form on . We have the following characterization of by virtue of .
Theorem 3.4
For every , the sequence of semigroup operators converges to in the strong operator topology.
Proof. Let be a subsequence of . By the diagonal argument, there exists a subsequence of such that
[TABLE]
where is defined as in (3.13). We define
[TABLE]
By the density of in and the contraction of the semigroup operators , we can extend to a collection of contraction linear operators on such that
[TABLE]
By (3.20), we find that
[TABLE]
Hence, for any and , we have
[TABLE]
By (3.21) and (3.2), we know that
[TABLE]
is well-defined. Moreover, by (3.23), we can show that is a collection of contraction linear operators on .
By (3.21) and (3.23), we find that
[TABLE]
Hence, there exists a collection of subsets of such that
[TABLE]
For , we obtain by (3.20) and (3.23) that
[TABLE]
For and , there exists such that
[TABLE]
By (3.20), (3.23), and (3.25), we know that there exists such that
[TABLE]
Since is arbitrary, we get
[TABLE]
[TABLE]
For and , we obtain by (3.19), (3.2), and the dominated convergence theorem that
[TABLE]
By (3.28), the right continuity of the function on , and the uniqueness of the Laplace transform, we find that
[TABLE]
which implies that
[TABLE]
By (3.2), (3.29), the fact that the function is decreasing on , and the continuity of the function on , we get
[TABLE]
which implies that
[TABLE]
Further, we obtain by the semigroup property that
[TABLE]
Since the subsequence of is arbitrary, we get
[TABLE]
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] M.A. Carlton (2002). A family of densities derived from the three-parameter Dirichlet process. J. Appl. Prob. 39 764-774.
- 2[2] C. Costantini, P. De Blasi, S.N. Ethier, M. Ruggiero, and D. Spanò (2017). Wright-Fisher construction of the two-parameter Poisson-Dirichlet diffusion. To appear in Ann. Appl. Probab.
- 3[3] S.N. Ethier (1990). The infinitely-many-neutral-alleles diffusion model with ages. Adv. Appl. Prob. 22 1-24.
- 4[4] S.N. Ethier and T.G. Kurtz (1981). The infinitely-many-neutral-alleles diffusion model. Adv. Appl. Prob. 13 429-452.
- 5[5] S.N. Ethier and T.G. Kurtz (1993). Fleming–Viot processes in population genetics. SIAM J. Control Optimization 31 345–386.
- 6[6] S. Feng and W. Sun (2010). Some diffusion processes associated with two parameter Poisson-Dirichlet distribution and Dirichlet process. Probab. Theory Relat. Fields 148 501–525.
- 7[7] S. Feng and F.Y. Wang (2007). A class of infinite-dimensional diffusion processes with connection to population genetics. J. Appl. Prob. 44 938-949.
- 8[8] T.S. Ferguson (1973). A Bayesian analysis of some nonparametric problems. Ann. Stat. 1 209-230.
