On quasi-infinitely divisible random measures

Riccardo Passeggeri

arXiv:1906.06736·math.PR·September 23, 2020

On quasi-infinitely divisible random measures

Riccardo Passeggeri

PDF

Open Access

TL;DR

This paper investigates quasi-infinitely divisible (QID) random measures, showing their density in the space of all completely random measures (CRMs), establishing a Lévy-Khintchine formulation, and exploring implications for Bayesian nonparametrics.

Contribution

It proves the density of QID CRMs in the space of all CRMs and establishes a Lévy-Khintchine representation with a one-to-one law correspondence.

Findings

01

QID CRMs are dense in all CRMs with respect to distribution convergence.

02

QID CRMs possess a Lévy-Khintchine type representation.

03

Results have implications for Bayesian nonparametric models.

Abstract

Quasi-infinitely divisible (QID) distributions have been recently introduced by Lindner, Pan and Sato (\textit{Trans.~Amer.~Math.~Soc.}~\textbf{370}, 8483-8520 (2018)). A random variable $X$ is QID if and only if there exist two infinitely divisible (ID) random variables $Y$ and $Z$ s.t.~ $X + Y = d Z$ and $Y$ is independent of $X$ . In this work, we show that a family of QID completely random measures (CRMs) is dense in the space of all CRMs with respect to convergence in distribution. We further demonstrate that the elements of this family posses a L\'{e}vy-Khintchine formulation and that there exists a one to one correspondence between their law and certain characteristic pairs. We prove the same results also for the class of point processes with independent increments. In the second part of the paper, we show the relevance of these results in the general Bayesian nonparametric…

Equations98

∣ μ ∣ (A) := sup j = 1 \sum \infty ∣ μ (A_{j}) ∣,

∣ μ ∣ (A) := sup j = 1 \sum \infty ∣ μ (A_{j}) ∣,

∣ ν ∣ ({0}) = ν^{+} ({0}) = ν^{-} ({0}) = 0

∣ ν ∣ ({0}) = ν^{+} ({0}) = ν^{-} ({0}) = 0

and ∣ ν ∣ (A) = ∣ ν_{∣ B_{r} (R)} ∣, ν^{+} (A) = ν_{∣ B_{r} (R)}^{+} (A), ν^{-} (A) = ν_{∣ B_{r} (R)}^{-} (A),

and ∣ ν ∣ (A) = ∣ ν_{∣ B_{r} (R)} ∣, ν^{+} (A) = ν_{∣ B_{r} (R)}^{+} (A), ν^{-} (A) = ν_{∣ B_{r} (R)}^{-} (A),

\overset{μ}{^} (θ) = exp (i θ γ - \frac{θ ^{2}}{2} a + \int_{R} (e^{i θ x} - 1 - i θ τ (x)) ν (d x))

\overset{μ}{^} (θ) = exp (i θ γ - \frac{θ ^{2}}{2} a + \int_{R} (e^{i θ x} - 1 - i θ τ (x)) ν (d x))

\int_{B} f d ν := \int_{B} f d ν^{+} - \int_{B} f d ν^{-}, B \in B (R) .

\int_{B} f d ν := \int_{B} f d ν^{+} - \int_{B} f d ν^{-}, B \in B (R) .

μ = \frac{δ _{γ} * exp ( ν )}{exp ( ν ( R ^{d} ))} .

μ = \frac{δ _{γ} * exp ( ν )}{exp ( ν ( R ^{d} ))} .

μ_{n} ({b_{j, n}}) = ⎩ ⎨ ⎧ μ ((- \infty, b_{0, n}]), μ ((b_{j - 1, n}, b_{j, n}]), μ ((b_{2 n^{2} - 1, n}, \infty)), j = 0, j = 1, ..., 2 n^{2} - 1, j = 2 n^{2} .

μ_{n} ({b_{j, n}}) = ⎩ ⎨ ⎧ μ ((- \infty, b_{0, n}]), μ ((b_{j - 1, n}, b_{j, n}]), μ ((b_{2 n^{2} - 1, n}, \infty)), j = 0, j = 1, ..., 2 n^{2} - 1, j = 2 n^{2} .

ρ (F, G) := in f {ε > 0 ∣ F (x - ε) - ε \leq G (x) \leq F (x + ε) + ε for all x \in R} .

ρ (F, G) := in f {ε > 0 ∣ F (x - ε) - ε \leq G (x) \leq F (x + ε) + ε for all x \in R} .

ρ (F_{c}, G_{c}) = in f {ε > 0 ∣ F_{c} (x - ε) - ε \leq G_{c} (x) \leq F_{c} (x + ε) + ε for all x \in R}

ρ (F_{c}, G_{c}) = in f {ε > 0 ∣ F_{c} (x - ε) - ε \leq G_{c} (x) \leq F_{c} (x + ε) + ε for all x \in R}

\leq in f {ε > 0 ∣ F (x - ε) - ε \leq G (x) \leq F (x + ε) + ε for all x \in R} = ρ (F, G) .

\leq in f {ε > 0 ∣ F (x - ε) - ε \leq G (x) \leq F (x + ε) + ε for all x \in R} = ρ (F, G) .

ρ (F_{1} * \dots * F_{k}, G_{1} * \dots * G_{k}) \leq j = 1 \sum k ρ (F_{j}, G_{j}) .

ρ (F_{1} * \dots * F_{k}, G_{1} * \dots * G_{k}) \leq j = 1 \sum k ρ (F_{j}, G_{j}) .

α = γ + \int_{0}^{\infty} \int_{S} x δ_{s} η (d s d x), a.s.

α = γ + \int_{0}^{\infty} \int_{S} x δ_{s} η (d s d x), a.s.

\int_{0}^{\infty} (1 \land x) F (A \times d x) < \infty,

\int_{0}^{\infty} (1 \land x) F (A \times d x) < \infty,

α (A) = γ (A) + \int_{0}^{\infty} x η (A \times d x) and α f = γ f + \int_{0}^{\infty} \int_{S} x f (s) η (d s d x), a.s..

α (A) = γ (A) + \int_{0}^{\infty} x η (A \times d x) and α f = γ f + \int_{0}^{\infty} \int_{S} x f (s) η (d s d x), a.s..

\int_{0}^{\infty} (1 \land x) F (S_{n} \times d x) < \infty \Rightarrow F (S_{n} \times (\frac{1}{n}, \infty)) < \infty,

\int_{0}^{\infty} (1 \land x) F (S_{n} \times d x) < \infty \Rightarrow F (S_{n} \times (\frac{1}{n}, \infty)) < \infty,

α_{n} = γ_{n} + \int_{0}^{\infty} \int_{S} x δ_{s} η_{n} (d s d x) .

α_{n} = γ_{n} + \int_{0}^{\infty} \int_{S} x δ_{s} η_{n} (d s d x) .

- lo g E [exp (- \int f (s) α (d s))] = γ f + \int_{0}^{\infty} \int_{S} 1 - e^{- x δ_{s} f} F (d s d x) .

- lo g E [exp (- \int f (s) α (d s))] = γ f + \int_{0}^{\infty} \int_{S} 1 - e^{- x δ_{s} f} F (d s d x) .

- lo g E [exp (- \int f (s) α (d s))] + lo g E [exp (- \int f (s) α_{n} (d s))]

- lo g E [exp (- \int f (s) α (d s))] + lo g E [exp (- \int f (s) α_{n} (d s))]

= \int_{S} f (s) γ (d s) + \int_{0}^{\infty} \int_{S} 1 - e^{- x f (s)} F (d s d x) - \int_{S_{n}} f (s) γ (d s) + \int_{\frac{1}{n}}^{\infty} \int_{S_{n}} 1 - e^{- x f (s)} F (d s d x)

= \int_{S} f (s) γ (d s) + \int_{0}^{\infty} \int_{S} 1 - e^{- x f (s)} F (d s d x) - \int_{S_{n}} f (s) γ (d s) + \int_{\frac{1}{n}}^{\infty} \int_{S_{n}} 1 - e^{- x f (s)} F (d s d x)

= \int_{S ∖ S_{n}} f (s) γ (d s) + \int_{0}^{\frac{1}{n}} \int_{S_{n}} 1 - e^{- x f (s)} F (d s d x) + \int_{0}^{\infty} \int_{S ∖ S_{n}} 1 - e^{- x f (s)} F (d s d x) \to 0,

= \int_{S ∖ S_{n}} f (s) γ (d s) + \int_{0}^{\frac{1}{n}} \int_{S_{n}} 1 - e^{- x f (s)} F (d s d x) + \int_{0}^{\infty} \int_{S ∖ S_{n}} 1 - e^{- x f (s)} F (d s d x) \to 0,

ξ = a . s . α + j = 1 \sum K β_{j} δ_{s_{j}}

ξ = a . s . α + j = 1 \sum K β_{j} δ_{s_{j}}

\mathcal{A}:=\bigg{\{}\xi\in\mathcal{I}\bigg{|}\xi\stackrel{{\scriptstyle a.s.}}{{=}}\alpha+\sum_{j=1}^{K}\beta_{j}\delta_{s_{j}},\textnormal{with $\alpha$ an atomless CRM with finite L\'{e}vy measure, $\{s_{j}:j=1,...,K\}$}

\mathcal{A}:=\bigg{\{}\xi\in\mathcal{I}\bigg{|}\xi\stackrel{{\scriptstyle a.s.}}{{=}}\alpha+\sum_{j=1}^{K}\beta_{j}\delta_{s_{j}},\textnormal{with $\alpha$ an atomless CRM with finite L\'{e}vy measure, $\{s_{j}:j=1,...,K\}$}

S

S

\textnormal{measure and zero Gaussian variance and that are mutually independent and independent of $\alpha$}\bigg{\}}.

\textnormal{measure and zero Gaussian variance and that are mutually independent and independent of $\alpha$}\bigg{\}}.

ξ = a . s . α + j = 1 \sum K β_{j} δ_{s_{j}}

ξ = a . s . α + j = 1 \sum K β_{j} δ_{s_{j}}

\mathbb{P}^{\prime}\Big{(}\alpha_{n}^{\prime}(B_{1})<x_{1},...,\alpha_{n}^{\prime}(B_{k})<x_{k},\delta_{s_{1}}(B_{1})\beta^{\prime}_{n,1}<x^{(1)}_{1},...,\delta_{s_{1}}(B_{k})\beta^{\prime}_{n,1}<x^{(1)}_{k},

\mathbb{P}^{\prime}\Big{(}\alpha_{n}^{\prime}(B_{1})<x_{1},...,\alpha_{n}^{\prime}(B_{k})<x_{k},\delta_{s_{1}}(B_{1})\beta^{\prime}_{n,1}<x^{(1)}_{1},...,\delta_{s_{1}}(B_{k})\beta^{\prime}_{n,1}<x^{(1)}_{k},

...,\delta_{s_{n}}(B_{1})\beta^{\prime}_{n,n}<x^{(n)}_{1},...,\delta_{s_{n}}(B_{k})\beta^{\prime}_{n,n}<x^{(n)}_{k}\Big{)}=\mathbb{P}\Big{(}\alpha(B_{1})<x_{1},...,\alpha(B_{k})<x_{k}\Big{)}

...,\delta_{s_{n}}(B_{1})\beta^{\prime}_{n,n}<x^{(n)}_{1},...,\delta_{s_{n}}(B_{k})\beta^{\prime}_{n,n}<x^{(n)}_{k}\Big{)}=\mathbb{P}\Big{(}\alpha(B_{1})<x_{1},...,\alpha(B_{k})<x_{k}\Big{)}

\mathbb{P}_{1}\Big{(}\delta_{s_{1}}(B_{1})\beta^{\prime}_{n,1}<x^{(1)}_{1},...,\delta_{s_{1}}(B_{k})\beta^{\prime}_{n,1}<x^{(1)}_{k}\Big{)}\cdots\mathbb{P}_{n}\Big{(}\delta_{s_{n}}(B_{1})\beta^{\prime}_{n,n}<x^{(n)}_{1},...,\delta_{s_{n}}(B_{k})\beta^{\prime}_{n,n}<x^{(n)}_{k}\Big{)}

\mathbb{P}_{1}\Big{(}\delta_{s_{1}}(B_{1})\beta^{\prime}_{n,1}<x^{(1)}_{1},...,\delta_{s_{1}}(B_{k})\beta^{\prime}_{n,1}<x^{(1)}_{k}\Big{)}\cdots\mathbb{P}_{n}\Big{(}\delta_{s_{n}}(B_{1})\beta^{\prime}_{n,n}<x^{(n)}_{1},...,\delta_{s_{n}}(B_{k})\beta^{\prime}_{n,n}<x^{(n)}_{k}\Big{)}

ξ_{n} (\cdot) (ω^{'}) := α_{n}^{'} (\cdot) (ω^{'}) + j = 1 \sum n β_{n, j}^{'} (ω^{'}) δ_{s_{j}} (\cdot), \forall ω^{'} \in Ω^{'},

ξ_{n} (\cdot) (ω^{'}) := α_{n}^{'} (\cdot) (ω^{'}) + j = 1 \sum n β_{n, j}^{'} (ω^{'}) δ_{s_{j}} (\cdot), \forall ω^{'} \in Ω^{'},

ρ (j = 1 \sum n f (s_{j}) β_{n, j}^{'}, j = 1 \sum \infty f (s_{j}) β_{j}) \leq ρ (j = 1 \sum n f (s_{j}) β_{n, j}^{'}, j = 1 \sum n f (s_{j}) β_{j}) + ρ (j = 1 \sum n f (s_{j}) β_{j}, j = 1 \sum \infty f (s_{j}) β_{j}) .

ρ (j = 1 \sum n f (s_{j}) β_{n, j}^{'}, j = 1 \sum \infty f (s_{j}) β_{j}) \leq ρ (j = 1 \sum n f (s_{j}) β_{n, j}^{'}, j = 1 \sum n f (s_{j}) β_{j}) + ρ (j = 1 \sum n f (s_{j}) β_{j}, j = 1 \sum \infty f (s_{j}) β_{j}) .

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsBayesian Methods and Mixture Models · Statistical Methods and Inference · Statistical Methods and Bayesian Inference

Full text

On quasi-infinitely divisible random measures

Riccardo Passeggeri111LPSM, Sorbonne University. Email: [email protected]

Abstract

Quasi-infinitely divisible (QID) distributions have been recently introduced by Lindner, Pan and Sato (Trans. Amer. Math. Soc. 370, 8483-8520 (2018)). A random variable $X$ is QID if and only if there exist two infinitely divisible (ID) random variables $Y$ and $Z$ s.t. $X+Y\stackrel{{\scriptstyle d}}{{=}}Z$ and $Y$ is independent of $X$ . In this work, we show that a family of QID completely random measures (CRMs) is dense in the space of all CRMs with respect to convergence in distribution. We further demonstrate that the elements of this family posses a Lévy-Khintchine formulation and that there exists a one to one correspondence between their law and certain characteristic pairs. We prove the same results also for the class of point processes with independent increments. In the second part of the paper, we show the relevance of these results in the general Bayesian nonparametric framework based on CRMs developed by Broderick, Wilson and Jordan (Bernoulli, 24, 3181-3221 (2018)).

Keywords: quasi-infinite divisibility, completely random measure, dense class, nonparametric Bayesian analysis, automatic conjugacy.

MSC (2010): 60E07, 60G57, 60A10, 62F15

1 Introduction

A random measure $\xi$ on $S$ , with underlying probability space $(\Omega,\mathcal{F},\mathbb{P})$ , is a function $\Omega\times\textbf{S}\rightarrow[0,\infty]$ , such that $\xi(\omega,B)$ is a $\mathcal{F}$ -measurable in $\omega\in\Omega$ for fixed $B$ and a locally finite measure in $B\in\textbf{S}$ for fixed $\omega$ . Completely random measures (CRMs) have the additional property that for any disjoint $B_{1},B_{2},...,B_{k}\in\textbf{S}$ , $k\in\mathbb{N}$ , the random variables $\xi(B_{1}),\xi(B_{2}),...,\xi(B_{k})$ are independent. CRMs, also called independently scattered random measures or random measures with independent increments, have a fundamental role in nonparametric Bayesian analysis; as Ghosal and van der Vaart affirm in their recent book (see [7]) CRMs “arise as priors, or building blocks for priors, in many Bayesian nonparametric applications”.

Completely random measures have a long history which is inextricably linked with the one of infinitely divisible distributions. In 1967, Kingman [14] proved a very appealing and useful representation theorem for all CRMs. He showed that any CRM $\xi$ is almost surely given by the sum of three components: one deterministic, one concentrated on a fixed set of atoms, and one concentrated on a random set of atoms. He further showed that the last component, which he called the ordinary component, is fully determined by a Poisson point process: $\xi_{ord}(B)=\int_{(0,\infty)}x\eta(B\times dx)$ , where $\eta$ is a Poisson point process on $S\times(0,\infty)$ . The Poisson point process is the prime example of infinitely divisible CRM.

Infinitely divisible (ID) distributions have an even longer history that goes back to the work of Lévy, Kolmogorov and De Finetti, among others. They constitute one of the most studied classes of probability distributions. One of their most attractive properties is that their characteristic function have an explicit formulation, called the Lévy-Khintchine formulation, written in terms of three mathematical objects. These are the drift, which is a real valued constant, the Gaussian component, which is a non-negative constant, and the Lévy measure, which is a measure on $\mathbb{R}$ satisfying an integrability condition and with no mass at $\{0\}$ . Gaussian and Poisson distributions are examples of this class.

In 2018, in [15] Sato, Lindner and Pan introduced the class of quasi-infinitely divisible (QID) distributions. A QID random variable is defined as follows: a random variable $X$ is QID (namely has a QID distribution) if and only if there exist two ID random variables $Y$ and $Z$ s.t. $X+Y\stackrel{{\scriptstyle d}}{{=}}Z$ and $Y$ is independent of $X$ . QID distributions are like ID distributions except for the fact that the Lévy measure is now allowed to take negative values. In other words, a QID distribution has a Lévy-Khintchine formulation which is uniquely determined by a drift, a Gaussian component and by a ‘signed measure’ (more precisely a real valued set function) called the quasi-Lévy measure. Any ID distribution is QID, but the converse is not always true.

In [15], the authors show that QID distributions are dense in the space of all probability distributions with respect to weak convergence and that distributions concentrated on the integers (or any shift and dilation of them) are QID if and only if their characteristic functions have no zeros, among other results. Further theoretical results have been achieved in [1, 13, 19, 20]. In [19], the QID framework is extended to real-valued random noises and stochastic processes. QID distributions have already shown to have an impact in various fields: from mathematical physics, see [4] and [6], to number theory, see [16] and [17], and to insurance mathematics, see [21].

The first main contribution of this paper is the density result for QID random measures. We prove that a certain class of QID completely random measures (CRMs), which we denote by $\mathcal{A}$ , is dense with respect to convergence in distribution (precisely in both weak and vague convergence) in the space of all CRMs, also know as random measures with independent increments or as independently scattered random measures. This result extends the density result in [15] to the infinite dimensional setting of CRMs.

The class $\mathcal{A}$ have quite remarkable features. First, as any CRM they have an almost sure representation in terms of an ‘atomless’ ID component and an ‘atomic’ one. Second, the number of atoms is finite. Third, these random measures are almost surely finite and even more their atomless component has finite Lévy measure.

Moreover, for the elements of this class, we are able to show an explicit spectral representation, namely the Lévy-Khintchine formulation, and prove that there exists a unique one-to-one correspondence between them and pairs of deterministic measures satisfying certain conditions, which we call characteristic pairs. We prove all these results also for the class of point processes with independent increments, of which the Poisson point process is an example.

With these results this paper shows that the fixed component of a CRM, which has been left out in Kingman’s analysis and in the theory of CRM in general, have exactly the same nice representation results as the widely studied ordinary component. Thus, not only there is no real need of leaving out of the analysis the fixed component (as Kingman graphically says, fixed atoms can be removed by simple surgery), but this might also be dangerous since in many applications the fixed component has an irreplaceable role. This will also appear evident in the Bayesian setting we discuss in this work (see also [2]).

In the last section we investigate the impact of these results in the nonparametric Bayesian statistical framework presented by Broderick, Wilson and Jordan in [2] based on CRMs (see also [3]). In particular, we consider priors to be given by elements in $\mathcal{A}$ (with quasi-Lévy measure having a particular structure). We show that they are dense in the space of priors considered in [2] and [3] with respect to convergence in distribution, thus showing also that our density result is flexible enough to adjust to various assumptions/settings. Second, we present explicit formulations for their posterior distributions. Third, when focusing on point processes, we prove automatic conjugacy for all the elements of $\mathcal{A}$ under the only condition that the characteristic function of the posterior distribution has no zeros. This condition is satisfied in many situations and the result is more general than the the one of [2] which is based on the exponential structure of the likelihood.

We remark that the general nature of our results allow them to be applied in many Bayesian settings. Thus, the choice of the work of Broderick, Wilson and Jordan [2] represents a first easy example.

The paper is structured as follows. Section 2 concerns with the notations and some preliminaries. In Section 3 we provide the density results for CRMs and in Subsection 3.1 the one for point processes with independent increments. In Section 4, we show various properties for the classes of QID random measures and QID point processes presented in Section 3. In particular we present the Lévy-Khintchine formulation and the one-to-one correspondence of these random measures with their unique characteristic pair. In Section 5, we present the Bayesian setting and the relative results: computation of the posterior, convergence results for the posterior, and automatic conjugacy.

2 Notation and Preliminaries

By a measure on a measurable space $(X,\mathcal{G})$ we always mean a positive measure on $(X,\mathcal{G})$ , i.e. a $[0,\infty]$ -valued $\sigma$ -additive set function on $\mathcal{G}$ that assigns the value [math] to the empty set. For a non-empty set $X$ , by $\mathcal{B}(X)$ we mean the Borel $\sigma$ -algebra of $X$ , unless stated differently. The law and the characteristic function of a random variable $X$ will be denoted by $\mathcal{L}(X)$ and by $\hat{\mathcal{L}}(X)$ , respectively. For two measurable spaces $(X,\mathcal{G})$ and $(Y,\mathcal{F})$ , we denote by $\mathcal{G}\otimes\mathcal{F}$ the product $\sigma$ -algebra of $\mathcal{G}$ and $\mathcal{F}$ , and by $\mathcal{G}\times\mathcal{F}$ their Cartesian product. Let us recall some definitions.

Definition 2.1 (extended signed measure).

Given a measurable space $(X,\Sigma)$ , that is, a set $X$ with a $\sigma$ -algebra $\Sigma$ on it, an extended signed measure is a function $\ \mu:\Sigma\to{\mathbb{R}}\cup\{\infty,-\infty\}$ s.t. $\mu(\emptyset)=0$ and $\mu$ is $\sigma$ -additive, that is, it satisfies the equality $\mu\left(\bigcup_{{n=1}}^{\infty}A_{n}\right)=\sum_{{n=1}}^{\infty}\mu(A_{n})$ where the series on the right must converge in ${\mathbb{R}}\cup\{\infty,-\infty\}$ absolutely (namely the value of the series is independent of the order of its elements), for any sequence $A_{1},A_{2},...$ of disjoint sets in $\Sigma$ .

As a consequence any extended signed measure can take plus or minus infinity as value, but not both. In this work, we use the term ‘signed measure’ for an extended signed measure. Further, the total variation of a signed measure $\mu$ is defined as the measure $|\mu|:\Sigma\rightarrow[0,\infty]$ defined by

[TABLE]

where the supremum is taken over all the partitions $\{A_{j}\}$ of $A\in\Sigma$ . The total variation $|\mu|$ is finite if and only if $\mu$ is finite. Let us recall the definition of a signed bimeasure.

Definition 2.2 (Signed bimeasure).

*Let $(X,\Sigma)$ and $(Y,\Gamma)$ be two measurable spaces. A signed bimeasure is a function $M:\Sigma\times\Gamma\rightarrow[-\infty,\infty]$ such that:

(i) the function $A\rightarrow M(A,B)$ is a signed measure on $\Sigma$ for every $B\in\Gamma$ ,

(i) the function $B\rightarrow M(A,B)$ is a signed measure on $\Gamma$ for every $A\in\Sigma$ .*

Let $S$ be a separable and complete metric space with Borel $\sigma$ -algebra S and let $\hat{\mathbf{S}}$ be the ring composed by bounded Borel sets in $S$ . The triplet $(S,\textbf{S},\hat{\textbf{S}})$ is called localised Borel space (see page 19 in [12]).

Definition 2.3 (random measure).

A random measure $\xi$ on $S$ , with underlying probability space $(\Omega,\mathcal{F},\mathbb{P})$ , is a function $\Omega\times\textbf{S}\rightarrow[0,\infty]$ , such that $\xi(\omega,B)$ is a $\mathcal{F}$ -measurable in $\omega\in\Omega$ for fixed $B$ and a locally finite measure in $B\in\textbf{S}$ for fixed $\omega$ .

Definition 2.4 (completely random measure).

A completely random measure (CRM) $\xi$ is a random measure s.t. for any disjoint $B_{1},B_{2},...,B_{k}\in\textbf{S}$ , $k\in\mathbb{N}$ , the random variables $\xi(B_{1}),\xi(B_{2}),...,\xi(B_{k})$ are independent. CRMs are also called independently scattered random measure or random measure with independent increments.

Definition 2.5 (diffuse random measure).

Using the notation of the previous definition, we say that a random measure $\xi$ on $S$ is diffuse if $\xi(\omega,B)$ is a locally finite diffuse measure in $B\in\textbf{S}$ for fixed $\omega$ .

Remark 2.6.

Term finite for random measures stands for a.s. finite. Thus, for a finite random measure we mean an a.s. finite random measure.

For a random measure $\xi$ on a Polish space $X$ , $x\in X$ is a fixed atom of $\xi$ if and only if $\mathbb{P}(|\xi(\{x\})|>0)>0$ . Further, a random measure $\xi$ is called atomless if $\xi(\{x\})\stackrel{{\scriptstyle a.s.}}{{=}}0$ for every $x\in X$ . The atomless condition is for random measures what the continuity in probability is for continuous time stochastic processes. We remark that an atomless random measure is not necessarily a diffuse random measure (see Corollary 12.11 in [11]). For example, think of a Poisson point process with $\mathbb{E}[\xi(s)]\equiv 0$ , like the homogeneous Poisson point process, which has no fixed atoms but it is not diffuse.

Now, we introduce the concept of a quasi-Lévy type measure. We start with the following definition, which we recall from [15]:

Definition 2.7.

Let $\mathcal{B}_{r}(\mathbb{R}):=\{B\in\mathcal{B}(\mathbb{R})|B\cap(-r,r)=\emptyset\}$ for $r>0$ and $\mathcal{B}_{0}(\mathbb{R}):=\bigcup_{r>0}\mathcal{B}_{r}(\mathbb{R})$ be the class of all Borel sets that are bounded away from zero. Let $\nu:\mathcal{B}_{0}(\mathbb{R})\rightarrow\mathbb{R}$ be a function such that $\nu_{|\mathcal{B}_{r}(\mathbb{R})}$ is a finite signed measure for each $r>0$ and denote the total variation, positive and negative part of $\nu_{|\mathcal{B}_{r}(\mathbb{R})}$ by $|\nu_{|\mathcal{B}_{r}(\mathbb{R})}|$ , $\nu^{+}_{|\mathcal{B}_{r}(\mathbb{R})}$ and $\nu^{-}_{|\mathcal{B}_{r}(\mathbb{R})}$ respectively. Then the total variation $|\nu|$ , the positive part $\nu^{+}$ and the negative part $\nu^{-}$ of $\nu$ are defined to be the unique measures on $(\mathbb{R},\mathcal{B}(\mathbb{R}))$ satisfying

[TABLE]

for $A\in\mathcal{B}_{r}(\mathbb{R})$ , for some $r>0$ .

As mentioned in [15], $\nu$ is not a a signed measure because it is defined on $\mathcal{B}_{0}(\mathbb{R})$ , which is not a $\sigma$ -algebra. In the case it is possible to extend the definition of $\nu$ to $\mathcal{B}(\mathbb{R})$ such that $\nu$ is a signed measure then we will identify $\nu$ with its extension to $\mathcal{B}(\mathbb{R})$ and speak of $\nu$ as a signed measure. Moreover, the uniqueness of $|\nu|$ , $\nu^{+}$ and $\nu^{-}$ is ensured by an application of the Carathéodory’s extension theorem (see Lemma 2.14 in [19]). Further, notice that $\mathcal{B}_{0}(\mathbb{R})=\{B\in\mathcal{B}(\mathbb{R}):0\notin\overline{B}\}\neq\{B\in\mathcal{B}(\mathbb{R}):0\notin B\}$ (see Remark 2.6 in [19]).

Definition 2.8 (quasi-Lévy type measure, quasi-Lévy measure, QID distribution, from [15]).

A quasi-Lévy type measure is a function $\nu:\mathcal{B}_{0}(\mathbb{R})\rightarrow\mathbb{R}$ satisfying the condition in Definition 2.7 and such that its total variation $|\nu|$ satisfies $\int_{\mathbb{R}}(1\wedge x^{2})|\nu|(dx)<\infty$ . Let $\mu$ be a probability distribution on $\mathbb{R}$ . We say that $\mu$ is quasi-infinitely divisible if its characteristic function has a representation

[TABLE]

where $a,\gamma\in\mathbb{R}$ and $\nu$ is a quasi-Lévy type measure. The characteristic triplet $(\gamma,a,\nu)$ of $\mu$ is unique, and $a$ and $\gamma$ are called the Gaussian variance and the drift of $\mu$ , respectively. A quasi-Lévy type measure $\nu$ is called quasi-Lévy measure, if additionally there exist a quasi-infinitely divisible distribution $\mu$ and some $a,\gamma\in\mathbb{R}$ such that $(\gamma,a,\nu)$ is the characteristic triplet of $\mu$ . We call $\nu$ the quasi-Lévy measure of $\mu$ .

The above definition extend to the $\mathbb{R}^{d}$ case (for $d>1$ ) as shown in Remark 2.4 in [15]. As pointed out in Example 2.9 of [15], a quasi-Lévy measure is always a quasi-Lévy type measure, while the converse is not true. Moreover, we say that a function $f$ is integrable with respect to quasi-Lévy type measure $\nu$ if it is integrable with respect to $|\nu|$ . Then, we define:

[TABLE]

In this work we always keep the same order for the elements in the characteristic triplet: the first element is the drift, the second one is the Gaussian variance, and the third one is the (quasi) Lévy measure.

Definition 2.9 (QID random measure).

Let $\Lambda$ be a random measure. If $\Lambda(A)$ is a QID random variable, for every $A\in\textbf{S}$ , then we call $\Lambda$ a QID random measure.

We conclude with the following result on QID distributions.

Theorem 2.10 (Theorem 4.3.4 in [5]).

Let $d\in\mathbb{N}$ . The characteristic triplet $(\gamma,0,\nu)$ , where $\nu$ is a finite quasi-Lévy type measure, is the characteristic triplet of a QID distribution on $\mathbb{R}^{d}$ if and only if $\exp(\nu):=\sum_{n=1}^{\infty}\frac{\nu^{*n}}{n!}$ is a measure. In that case, $\mu\sim(\gamma,0,\nu)$ is given by

[TABLE]

3 The density result for QID CRMs

In this section we present the density results for QID CRMs in the space of all CRMs with respect to convergence in distribution. Let us start with some preliminaries. Let $S$ be a separable and complete metric space with Borel $\sigma$ -algebra S and let $\hat{\mathbf{S}}$ be the ring composed by bounded Borel sets in $S$ . Let $\hat{C}_{S}$ be the space of all bounded continuous functions $f:S\rightarrow\mathbb{R}_{+}$ with bounded support. Let $\mathcal{M}_{S}$ be the space of locally finite measures, namely $\mu\in\mathcal{M}_{S}$ if $\mu(B)<\infty$ for every $B\in\hat{\mathbf{S}}$ . The space $\mathcal{M}_{S}$ might be endowed with the vague topology, denoted by $\mathbf{B}_{\mathcal{M}_{S}}$ , generated by the integration maps $\pi_{f}:\mu\mapsto\int f(x)\mu(dx)$ , for all $f\in\hat{C}_{S}$ . The vague topology is the coarsest topology making all $\pi_{f}$ continuous. The measurable space $(\mathcal{M}_{s},\mathbf{B}_{\mathcal{M}_{S}})$ is a Polish space. The associated notion of vague convergence denoted by $\mu_{n}\stackrel{{\scriptstyle v}}{{\rightarrow}}\mu$ is defined by the condition $\int f(x)\mu_{n}(dx)\rightarrow\int f(x)\mu(dx)$ for all $f\in\hat{C}_{S}$ .

An equivalent definition of random measure (see Definition 2.3) is the following: a random measure $\xi$ is a measurable mapping from $(\Omega,\mathcal{F},\mathbb{P})$ to $(\mathcal{M}_{S},\mathcal{B}_{\mathcal{M}_{S}})$ , where $\mathcal{B}_{\mathcal{M}_{S}}$ is the topology generated by all projection maps $\pi_{B}:\mu\mapsto\mu(B)$ with $B\in\mathbf{S}$ , or, equivalently, by all integration maps $\pi_{f}$ with measurable $f\geq 0$ . From Lemma 4.1 in [10] or Theorem 4.2 in [12], we know that $\mathcal{B}_{\mathcal{M}_{S}}$ and $\mathbf{B}_{\mathcal{M}_{S}}$ coincide. Hence it is equivalent to consider a random measure as a measurable mapping from $(\Omega,\mathcal{F},\mathbb{P})$ to $(\mathcal{M}_{S},\textbf{B}_{\mathcal{M}_{S}})$ or to $(\mathcal{M}_{S},\mathcal{B}_{\mathcal{M}_{S}})$ .

The convergence in distribution of $\xi_{n}$ to $\xi$ means that $\mathbb{E}[g(\xi_{n})]\rightarrow\mathbb{E}[g(\xi)]$ for every bounded continuous function $g$ on $\mathcal{M}_{S}$ , or equivalently that $\mathcal{L}(\xi_{n})\stackrel{{\scriptstyle w}}{{\rightarrow}}\mathcal{L}(\xi)$ , where for any bounded measures $\mu_{n}$ and $\mu$ , the weak convergence $\mu_{n}\stackrel{{\scriptstyle w}}{{\rightarrow}}\mu$ stands for $\int g(y)\mu_{n}(dy)\rightarrow\int g(y)\mu(dy)$ for all $g$ as above. We write $\xi_{n}\stackrel{{\scriptstyle vd}}{{\rightarrow}}\xi$ to stress that the convergence of distribution is for random measures considered as random elements in the space $\mathcal{M}_{S}$ with vague topology. As mentioned in the previous section, in this setting an atom of a random measure $\xi$ is an element $s\in S$ such that $\mathbb{P}(\xi(\{s\})>0)>0$ .

We recall now a fundamental result by Harris, see [8].

Theorem 3.1 (see Theorem 4.11 in [12]).

*Let $\xi,\xi_{1},\xi_{2},...$ be random measures on $S$ . Then these conditions are equivalent:

(i) $\xi_{n}\stackrel{{\scriptstyle vd}}{{\rightarrow}}\xi$ ,

(ii) $\int f(x)\xi_{n}(dx)\stackrel{{\scriptstyle d}}{{\rightarrow}}\int f(x)\xi(dx)$ for all $f\in\hat{C}_{S}$ ,

(iiI) $\mathbb{E}[\exp(-\int f(x)\xi_{n}(dx))]\rightarrow\mathbb{E}[\exp(-\int f(x)\xi(dx))]$ for all $f\in\hat{C}_{S}$ with $f\leq 1$ .*

The following density result extends Theorem 4.1 in [15].

Theorem 3.2.

Let $A$ be a connected interval of the real line. The class of QID distributions with finite quasi-Lévy measure, zero Gaussian variance and with support on $A$ is dense in the class of probability distributions with support on $A$ with respect to weak convergence.

Proof.

Some arguments of the proof are in nature similar to the ones of the proof of Theorem 4.1 in [15]. First, we prove the result when $A$ is bounded.

Let $A$ be a finite closed interval, thus $A=[k,c]$ for some $k,c\in\mathbb{R}$ . Let $\mu$ be a probability distribution with support $[k,c]$ . For $n\in\mathbb{N}$ , let $b_{j,n}=k+(c-k)j/2n^{2}$ , $j\in\{0,...,2n^{2}\}$ and define the discrete distribution $\mu_{n}$ concentrated on the lattice $\{b_{0,n},...,b_{2n^{2},n}\}$ by

[TABLE]

Then, $\mu_{n}\stackrel{{\scriptstyle w}}{{\rightarrow}}\mu$ as $n\rightarrow\infty$ . Observe that $\mu_{n}$ is the probability distribution of a random variable with values on $\{b_{0,n},...,b_{2n^{2},n}\}\subset[k,c]$ . It remains to prove that each $\mu_{n}$ is a weak limit of QID distributions with finite quasi-Lévy measure, zero Gaussian variance and with support on $[k,c]$ . W.l.o.g. assume that the approximating sequence of distributions $\sigma$ is such that $\sigma(\{b_{j,n}\})>0$ for every $j\in\{0,...,2n^{2}\}$ . Assume that the characteristic function $\hat{\sigma}$ has zeros (in the other case we can directly use Corollary 3.10 in [15] to conclude). Let $X$ be a random variable with distribution $\sigma$ and define $Y=\frac{(X-k)2n^{2}}{c-k}$ . Then, $Y$ is concentrated on $\{0,...,2n^{2}\}$ with masses $a_{j}=\mathbb{P}(Y=j)>0$ for $j=0,...,2n^{2}$ , and its characteristic function has zeroes. Then, the polynomial $f(w)=\sum_{j=0}^{2n^{2}}a_{j}w^{j}$ has zeroes on the unit circle. Factorizing, we obtain $f(w)=a_{2n^{2}}\prod_{j=1}^{2n^{2}}(w-\xi_{j})$ , where $\xi_{j}$ , $j=1,...,2n^{2}$ , denote the complex roots. Let $f_{h}(w)=a_{2n^{2}}\prod_{j=1}^{2n^{2}}(w-\xi_{j}-h)$ , where $w\in\mathbb{C}$ and $h>0$ . Then, for small enough $h$ , $f_{h}$ is a polynomial with real coefficients, namely $f_{h}(w)=\sum_{j=0}^{2n^{2}}a_{h,j}w^{j}$ with $a_{h,j}\in\mathbb{R}$ . Observe that for small enough $h$ , $a_{h,j}$ and $a_{j}$ will be close, so $a_{h,j}>0$ . Now, let $Z_{h}$ be a random variable with distribution $\sigma_{h}=\left(\sum_{j=0}^{2n^{2}}a_{h,j}\right)^{-1}\sum_{j=0}^{2n^{2}}a_{h,j}\delta_{j}$ and let $X_{h}=\frac{Z_{h}(c-k)}{2n^{2}}+k$ . Observe that, for every $h>0$ , $X_{h}$ is random variable with values on the lattice $\{b_{0,n},...,b_{2n^{2},n}\}$ and its characteristic function has no zeros, and that $X_{h}\stackrel{{\scriptstyle d}}{{\rightarrow}}X$ as $h\searrow 0$ . Finally, by Corollary 3.10 in [15] we know that $X_{h}$ is QID with finite quasi-Lévy measure and zero Gaussian variance.

Observe that if $A$ is a bounded open interval, say $A=(k^{\prime},c^{\prime})$ for some $c,k\in\mathbb{R}$ , then the above arguments apply. Let $\mu$ be a probability distribution with support $(k^{\prime},c^{\prime})$ . For any $n\in\mathbb{N}$ let $k^{\prime}_{n}=k^{\prime}+\frac{(c^{\prime}-k^{\prime})}{2n^{2}}$ and $c^{\prime}_{n}=c^{\prime}-\frac{(c^{\prime}-k^{\prime})}{2n^{2}}$ and let $b_{j,n}=k^{\prime}_{n}+(c^{\prime}_{n}-k^{\prime}_{n})j/2n^{2}$ , $j\in\{0,...,2n^{2}\}$ and define the discrete distribution $\mu_{n}$ concentrated on the lattice $\{b_{0,n},...,b_{2n^{2},n}\}$ as in (2). Then, $\mu_{n}\stackrel{{\scriptstyle w}}{{\rightarrow}}\mu$ as $n\rightarrow\infty$ and, applying the same reaming arguments (in which $n$ is fixed) for $k^{\prime}_{n}$ and $c^{\prime}_{n}$ instead of $k$ and $c$ , we obtain the result for $A$ bounded and open.

Let now $A$ be an unbounded interval of the form $A=[k,\infty)$ for some $k\in\mathbb{R}$ . Let $\mu$ be a probability distribution with support on $[k,\infty)$ . For $n\in\mathbb{N}$ , let $b_{j,n}=k+j/n$ , $j\in\{0,...,2n^{2}\}$ and define the discrete distribution $\mu_{n}$ concentrated on the lattice $\{b_{0,n},...,b_{2n^{2},n}\}$ as in (2). Then, $\mu_{n}\stackrel{{\scriptstyle w}}{{\rightarrow}}\mu$ as $n\rightarrow\infty$ . Using the notation above, let $X$ be a random variable with distribution $\sigma$ and define $Y=(X-k)n$ . Then, $Y$ is concentrated on $\{0,...,2n^{2}\}$ with masses $a_{j}$ and its characteristic function has zeroes by assumption. We proceed as before. Thus, for small enough $h$ , we obtain a polynomial with real coefficients $f_{h}$ , namely $f_{h}(w)=\sum_{j=0}^{2n^{2}}a_{h,j}w^{j}$ with $a_{h,j}\in\mathbb{R}$ and $a_{h,j}>0$ , for small enough $h$ . Then, let $Z_{h}$ be a random variable with distribution $\sigma_{h}=\left(\sum_{j=0}^{2n^{2}}a_{h,j}\right)^{-1}\sum_{j=0}^{2n^{2}}a_{h,j}\delta_{j}$ and let $X_{h}=\frac{Z_{h}}{n}+k$ . Then, $X_{h}$ is random variables with support on $\{b_{0,n},...,b_{2n^{2},n}\}\subset[k,\infty)$ and its characteristic function has no zeros, and that $X_{h}\stackrel{{\scriptstyle d}}{{\rightarrow}}X$ as $h\searrow 0$ . Hence, by Corollary 3.10 in [15] we obtain the result.

Similarly we obtain the result for $(k^{\prime},\infty)$ , for $(-\infty,c]$ and for $(-\infty,c^{\prime})$ , where $k^{\prime},c,c^{\prime}\in\mathbb{R}$ . ∎

Recall that the Lévy-Prokhorov metric (or better just Lévy metric since we work on $\mathbb{R}$ ) for two probability distributions $F$ and $G$ on $\mathbb{R}$ is defined as

[TABLE]

Lemma 3.3.

Let $F$ and $G$ be any two probability distributions on $\mathbb{R}$ and let $F_{c}(x):=F(\frac{x}{c})$ and $G_{c}(x):=G(\frac{x}{c})$ where $c\in\mathbb{R}\setminus\{0\}$ . For every positive constant $c\leq 1$ we have that $\rho(F_{c},G_{c})\leq\rho(F,G)$ .

Proof.

Let $c$ be any positive constant $c\leq 1$ . Observe that $F_{c}(x-\varepsilon)=F(\frac{x-\varepsilon}{c})\leq F(\frac{x}{c}-\varepsilon)$ and similarly we have that $F_{c}(x+\varepsilon)\geq F(\frac{x}{c}+\varepsilon)$ . This implies that if $\varepsilon>0$ satisfies $F(x-\varepsilon)-\varepsilon\leq G(x)\leq F(x+\varepsilon)+\varepsilon\textnormal{ for all$ x\in\mathbb{R} $}$ , then it also satisfies $F_{c}(x-\varepsilon)-\varepsilon\leq G_{c}(x)\leq F_{c}(x+\varepsilon)+\varepsilon\textnormal{ for all$ x\in\mathbb{R} $}$ . Then, we have

[TABLE]

∎

Observe that for two real valued random variables $X$ and $Y$ the above lemma affirms that for any $0<c\leq 1$ we have that $\rho(cX,cY)\leq\rho(X,Y)$ .

Another useful property of the Prokhorov metric is the following. From condition 3) of the section “Lévy metric” in [22] (page 405) given any probability distributions on $\mathbb{R}$ $F_{1},...,F_{k},G_{1},...,G_{k}$ , where $k\in\mathbb{N}$ , we have that

[TABLE]

For the next two results denote by $S_{n}$ the sequence of bounded sets (i.e. $S_{n}\in\hat{\textbf{S}}$ ) s.t. $S_{n}\uparrow S$ . Notice that such sequence exists by the definition of $\hat{\textbf{S}}$ , see page 19 in [12].

Proposition 3.4.

Consider an atomless CRM $\alpha$ with corresponding unique pair $(\gamma,F)$ . Let $\gamma_{n}(A)=\gamma(S_{n}\cap A)$ and let $F_{n}(C)=F(C\cap(S_{n}\times(\frac{1}{n},\infty)))$ , for every $A\in\textbf{S}$ , $C\in\textbf{S}\otimes\mathcal{B}((0,\infty))$ and $n\in\mathbb{N}$ . Then, $\gamma_{n}$ and $F_{n}$ are finite measures and there exists a sequence of atomless finite CRMs $\alpha_{n}$ with pair $(\gamma_{n},F_{n})$ s.t. $\alpha_{n}\stackrel{{\scriptstyle d}}{{\to}}\alpha$ .

Proof.

From Kingman’s representation theorem (see [14] and see also Corollary 12.11 in [11] and Corollary 3.21 in [12]), we have that every atomless CRM $\alpha$ has the following representation:

[TABLE]

for some non-random diffuse measure $\gamma\in\mathcal{M}_{S}$ and a Poisson process $\eta$ on $S\times(0,\infty)$ with intensity $F$ satisfying

[TABLE]

for every $A\in\hat{\textbf{S}}$ . In particular, for every $B\in\textbf{S}$ we have that $\alpha(B)<\infty$ if and only if $\gamma(B)<\infty$ and condition $(\ref{Poisson})$ holds for $B\in\textbf{S}$ (see Corollary 12.11 in [11]). Further, notice that the above formulation implies that for every $A\in\textbf{S}$ and $f\in\hat{C}_{S}$

[TABLE]

Moreover, the unique one to one correspondence between $\alpha$ and $(\gamma,F)$ is shown in Theorem 3.20 of [12].

It is possible to see that $\gamma_{n}$ and $F_{n}$ are measures on S and on $\textbf{S}\otimes\mathcal{B}((0,\infty))$ , respectively. In particular, since $\alpha(A)<\infty$ for every $A\in\hat{\textbf{S}}$ then $\gamma(S_{n})<\infty$ and

[TABLE]

for every $n\in\mathbb{N}$ . Thus, $\gamma_{n}$ and $F_{n}$ are finite measures, for every $n\in\mathbb{N}$ .

Now, for every $n\in\mathbb{N}$ , let $\eta_{n}$ be a Poisson process on $S\times(0,\infty)$ with intensity $F_{n}$ and let

[TABLE]

Then, we have that $\alpha_{n}$ is an atomless CRM and since $\gamma_{n}$ and $F_{n}$ are finite then $\alpha_{n}$ is finite, for every $n\in\mathbb{N}$ (see Corollary 12.11 in [11]).

Concerning the stated convergence we have the following. From Lemma 12.2 in [11] (or from Lemma 3.1 in [12]) we have that for every $f\in\hat{C}_{S}$

[TABLE]

Hence, by assumption we have that for every $f\in\hat{C}_{S}$

[TABLE]

as $n\to\infty$ . Then, by point (iii) in Theorem 3.1 (see also Lemma 4.24 in [12]) we obtain that $\alpha_{n}\stackrel{{\scriptstyle d}}{{\to}}\alpha$ , as $n\to\infty$ . ∎

Now, let us denote by $\mathcal{I}$ the set of all CRMs on $S$ (considered as random elements in $\mathcal{M}_{S}$ endowed with the vague topology) and recall that $\mathbb{Z}_{+}=\mathbb{N}\cup\{0\}$ . From Theorem 7.1 in [10] we know that an element $\xi$ of $\mathcal{I}$ has the following representation

[TABLE]

with $K\in\mathbb{Z}_{+}\cup\{\infty\}$ , where $\{s_{j}:j\geq 1\}$ is the set of fixed atoms of $\xi$ in $S$ , $\alpha$ is an atomless CRM, and $\beta_{j}$ , $j\geq 1$ , are $\mathbb{R}_{+}$ -valued random variables, which are mutually independent and independent of $\alpha$ . We call $\sum_{j=1}^{K}\beta_{j}\delta_{s_{j}}$ the fixed component of $\xi$ . We remark that in the Kingman’s representation $\alpha$ is the sum of a deterministic and a ordinary component as shown in the proof of Proposition 3.4 in eq. (4).

Consider the following class of QID random measures:

[TABLE]

First, notice that $\alpha$ is ID, because any atomless random measure with independent increments is ID. Second, observe that, in contrast with the usual representation of CRMs, the elements of $\mathcal{A}$ have that the atomless random measure $\alpha$ has finite Lévy measure (thus, $\alpha$ is finite), that the number of fixed atoms $K$ is finite, and that $\beta_{j}$ , $j=1,...,K$ , $\mathbb{R}_{+}$ -valued QID random variables with finite quasi-Lévy measure and zero Gaussian variance. Notice that the elements of $\mathcal{A}$ are almost surely finite on S. Thus, $\mathcal{A}$ is strictly smaller than the class of QID CRMs, which in turn is strictly smaller than the class of all CRMs (namely $\mathcal{I}$ ).

We are ready to present the main result of this section.

Theorem 3.5.

$\mathcal{A}$ * is dense in the space of all CRMs with respect to the convergence in distribution.*

Proof.

From Theorem 7.1 in [10] we know that any CRM has the following unique representation

[TABLE]

with $K\leq\infty$ , where $\{s_{j}:j\geq 1\}$ is the set of fixed atoms of $\xi$ , $\alpha$ is a random measure without fixed atoms with independent increments (hence, $\alpha$ is an atomless ID random measure), and $\beta_{j}$ , $j\geq 1$ , are $\mathbb{R}_{+}$ -valued random variables, which are mutually independent and independent of $\alpha$ .

From Theorem 3.2 with $A=[0,\infty)$ , we know that for each $\beta_{j}$ there exists a sequence of non-negative QID random variable with zero Gaussian variance and finite Lévy measure that converges in distribution to $\beta_{j}$ , for every $j\in\mathbb{N}$ . Denote by $\beta_{n,j}$ such a sequence.

Denote by $S_{n}$ the sequence of bounded sets s.t. $S_{n}\uparrow S$ and by $(\gamma,F)$ be the pair associated to $\alpha$ . Let $\gamma_{n}(A)=\gamma(S_{n}\cap A)$ and $F_{n}(C)=F(C\cap(S_{n}\times(\frac{1}{n},\infty)))$ , for every $A\in\textbf{S}$ , $C\in\textbf{S}\otimes\mathcal{B}((0,\infty))$ and $n\in\mathbb{N}$ , as in Proposition 3.4. Then, by Proposition 3.4 there exists a sequence of finite CRMs $\alpha_{n}$ with pair $(\gamma_{n},F_{n})$ s.t. $\alpha_{n}\stackrel{{\scriptstyle d}}{{\to}}\alpha$ .

The first step is to show the existence of random measures $\xi_{n}\in\mathcal{A}$ with ID random measure equals in distribution to $\alpha_{n}$ , with fixed atoms in $\{s_{j}:j\geq 1\}$ and weights equal in distributions to $\beta_{n,j}$ . The existence is not immediate because we do not know whether the $\beta_{n,j}$ are mutually independent and independent of $\alpha_{n}$ in the underlying probability space of $\xi$ . This is a classical problem in probability and the solution lies in the construction of a probability space under which these conditions are satisfied, which is given by the ‘product’ of the probability spaces.

For the sake of clarity and completeness let us write here the arguments. Fix $n\in\mathbb{N}$ . Denote the underlying probability spaces of $\alpha_{n}$ by $(\Omega,\mathcal{F},\mathbb{P})$ and of the random variable $\beta_{n,j}$ by $(\Omega_{j},\mathcal{F}_{j},\mathbb{P}_{j})$ , for $j=1,...,n$ . Consider the probability space $(\Omega^{\prime},\mathcal{F}^{\prime},\mathbb{P}^{\prime})$ where $\Omega^{\prime}=\Omega\times\Omega_{1}\times\cdots\times\Omega_{n}$ , $\mathcal{F}^{\prime}=\mathcal{F}\otimes\mathcal{F}_{1}\otimes\cdots\otimes\mathcal{F}_{n}$ and $\mathbb{P}^{\prime}$ is the product probability measure of $\mathbb{P},\mathbb{P}_{1}$ ,…, $\mathbb{P}_{n}$ .

Let $\alpha_{n}^{\prime}(\cdot)(\omega,\omega_{1},...,\omega_{n}):=\alpha_{n}(\cdot)(\omega)$ and let $\beta^{\prime}_{n,j}(\omega,\omega_{1},...,\omega_{n}):=\beta_{n,j}(\omega_{j})$ , where $j=1,...,n$ , for every $(\omega,\omega_{1},...,\omega_{n})\in\Omega^{\prime}$ . Observe that for every $B_{1},...,B_{k}\in\textbf{S}$ and $x_{1},...,x_{k},x^{(1)}_{1},...,x^{(1)}_{k},...,x^{(n)}_{1},...,x^{(n)}_{k}\in\mathbb{R}_{+}$ we have that

[TABLE]

Now, let

[TABLE]

where $s_{1}$ ,…, $s_{n}$ are the same as the ones in (6). It is possible to see that, for every $\omega^{\prime}\in\Omega^{\prime}$ , $\xi_{n}(\cdot)(\omega^{\prime})$ is a measure because it is the sum of measures and that, for every $B\in\textbf{S}$ , $\xi_{n}(B)(\cdot)$ is a measurable function because it is the sum of measurable functions. Thus, $\xi_{n}$ is a random measure on $S$ and from its definition it is possible to see that it belongs to $\mathcal{A}$ .

Since $\beta_{n,j}\stackrel{{\scriptstyle d}}{{\rightarrow}}\beta_{j}$ we can choose a subsequence of $\beta_{n,j}$ , which by abuse of notation we denote it by $\beta_{n,j}$ , such that $\rho(\beta_{n,j},\beta_{j})<\frac{1}{n^{2}}$ for every $j=1,...,n$ and $n\in\mathbb{N}$ . From the above arguments there exists a sequence of random measures in $\mathcal{A}$ (with possibly different underlying probability spaces) such that $\xi_{n}=\alpha_{n}^{\prime}+\sum_{j=1}^{n}\beta^{\prime}_{n,j}\delta_{s_{j}}$ . Thus, using that $\beta^{\prime}_{n,j}\stackrel{{\scriptstyle d}}{{=}}\beta_{n,j}$ we obtain that $\rho(\beta^{\prime}_{n,j},\beta_{j})<\frac{1}{n^{2}}$ for every $j=1,...,n$ and $n\in\mathbb{N}$ .

Now, we need to show that $\xi_{n}\stackrel{{\scriptstyle vd}}{{\rightarrow}}\xi$ . From Theorem 3.1, it is sufficient to show that $\int f(x)\xi_{n}(dx)\stackrel{{\scriptstyle d}}{{\rightarrow}}\int f(x)\xi(dx)$ for all $f\in\hat{C}_{S}$ . Since $\alpha^{\prime}_{n}\stackrel{{\scriptstyle d}}{{=}}\alpha_{n}$ for every $n\in\mathbb{N}$ and $\alpha_{n}\stackrel{{\scriptstyle d}}{{\to}}\alpha$ for every $\omega\in\Omega$ then $\alpha^{\prime}_{n}\stackrel{{\scriptstyle d}}{{\to}}\alpha$ . Further, since $\alpha^{\prime}_{n}$ and $\alpha$ are independent of the corresponding fixed component, this reduces the goal to prove that $\sum_{j=1}^{n}f(s_{j})\beta_{n,j}^{\prime}\stackrel{{\scriptstyle d}}{{\rightarrow}}\sum_{j=1}^{\infty}f(s_{j})\beta_{j}$ for all $f\in\hat{C}_{S}$ .

Let $f\in\hat{C}_{S}$ , hence, $f$ is bounded and has bounded support, and by denoting $B$ the support of $f$ we have that $B\in\hat{\textbf{S}}$ and so that almost surely $\xi_{n}(B)<\infty$ , $n\in\mathbb{N}$ , and $\xi(B)<\infty$ . Thus, for each $n\in\mathbb{N}$ , $\sum_{j=1}^{n}f(s_{j})\beta_{n,j}^{\prime}<\infty$ a.s. and $\sum_{j=1}^{\infty}f(s_{j})\beta_{j}<\infty$ a.s..

Moreover, notice that it is sufficient to prove the result for any $f\in\hat{C}_{S}$ with $f(s)\leq 1$ for every $s\in S$ . Indeed, consider any $f\in\hat{C}_{S}$ and let $\bar{C}\in\mathbb{R}_{+}$ be its bound, then $\sum_{j=1}^{n}f(s_{j})\beta_{n,j}^{\prime}=\bar{C}\sum_{j=1}^{n}\frac{f(s_{j})}{\bar{C}}\beta_{n,j}^{\prime}$ and so if $\sum_{j=1}^{n}\frac{f(s_{j})}{\bar{C}}\beta_{n,j}^{\prime}\stackrel{{\scriptstyle d}}{{\rightarrow}}\sum_{j=1}^{\infty}\frac{f(s_{j})}{\bar{C}}\beta_{j}$ then $\sum_{j=1}^{n}f(s_{j})\beta_{n,j}^{\prime}\stackrel{{\scriptstyle d}}{{\rightarrow}}\sum_{j=1}^{\infty}f(s_{j})\beta_{j}$ .

Now, consider any $f\in\hat{C}_{S}$ with $f(s)\leq 1$ for every $s\in S$ . By the triangular inequality we have that

[TABLE]

The last element converges to zero as $n\rightarrow\infty$ because $\sum_{j=1}^{n}f(s_{j})\beta_{j}\stackrel{{\scriptstyle a.s.}}{{\rightarrow}}\sum_{j=1}^{\infty}f(s_{j})\beta_{j}$ as $n\rightarrow\infty$ . For the other element, by (3) and by Lemma 3.3 we obtain that

[TABLE]

Thus, we have that $\sum_{j=1}^{n}f(s_{j})\beta_{n,j}^{\prime}\stackrel{{\scriptstyle d}}{{\rightarrow}}\sum_{j=1}^{\infty}f(s_{j})\beta_{j}$ as $n\rightarrow\infty$ , which concludes the proof. ∎

Remark 3.6.

We could alternatively consider an almost sure equality in (7) and then use the existence and uniqueness results for random measures (see Theorem 2.15 and Corollary 2.16 in [11]) to obtain a random measure almost surely equal to $\xi_{n}$ . In addition, by the Kolmogorov extension theorem the same arguments of the first part of the above proof hold for the case of $n$ ‘equal’ to infinity, namely $\xi_{n}=\alpha_{n}^{\prime}+\sum_{j=1}^{\infty}\beta^{\prime}_{n,j}$ .

Further, we point out that if $\xi$ is such that the number of fixed atoms in any bounded set (i.e. in any $B\in\hat{\textbf{S}}$ ) is finite then the number of fixed atoms in the support of every $f\in\hat{C}_{S}$ is finite, namely $\{s_{j}:j\geq 1\}\cap\textnormal{supp}(f)$ has finite cardinality, and so the stated result follows directly from the mutual independence of the $\beta_{n,j}^{\prime}$ , $j=1,...,n$ , from the fact that $\beta_{n,j}^{\prime}\stackrel{{\scriptstyle d}}{{\rightarrow}}\beta_{j}$ as $n\rightarrow\infty$ , for every $j=1,...,n$ and $n\in\mathbb{N}$ , and from the continuous mapping theorem.

Remark 3.7.

Let $\mathcal{A}_{\infty}$ be a class of random measures like $\mathcal{A}$ , but such that the ID component is not necessarily finite, i.e. the ‘ $\alpha$ ’ is not necessarily finite. Then, trivially $\mathcal{A}_{\infty}$ is dense in $\mathcal{I}$ w.r.t. the convergence in distribution. Indeed, let $\xi=\alpha+\sum_{j=1}^{K}\beta_{j}\delta_{s_{j}}$ be any CRM on $S$ . If we know the ID component of $\xi$ , i.e. $\alpha$ , and for modelling/theoretical reasons we can take an approximating sequence of unbounded $\xi_{n}$ , then we can define the $\xi_{n}$ s.t. $\xi_{n}(\cdot)(\omega^{\prime}):=\tilde{\alpha}_{n}^{\prime}(\cdot)(\omega^{\prime})+\sum_{j=1}^{n}\beta^{\prime}_{n,j}(\omega^{\prime})\delta_{s_{j}}(\cdot),$ $\forall\omega^{\prime}\in\Omega^{\prime}$ , where $\tilde{\alpha}_{n}^{\prime}(\cdot)(\omega,\omega_{1},...,\omega_{n}):=\alpha(\cdot)(\omega)$ . Then, $\xi_{n}\in\mathcal{A}_{\infty}$ and from the arguments of the proof of Theorem 3.5 it is possible to see that $\xi_{n}\stackrel{{\scriptstyle d}}{{\to}}\xi$ .

It is possible to consider also the set of bounded measures, denoted by $\hat{\mathcal{M}}_{S}$ , which can be endowed with the vague topology, as for $\mathcal{M}_{S}$ , but also with the weak topology. The weak topology on $\hat{\mathcal{M}}_{S}$ is the topology generated by the integration maps $\pi_{f}$ for all bounded continuous functions. Then, for random measures $\xi,\xi_{1},\xi_{2},...$ considered as random elements in $\hat{\mathcal{M}}_{S}$ , endowed with the weak topology, we will denote by $\xi_{n}\stackrel{{\scriptstyle wd}}{{\rightarrow}}\xi$ the convergence in distribution. Observe that in this setting a QID random measures as defined in Definition are QID random measures on $(S,\textbf{S})$ (hence we do not need to extend them) because for every $B\in\textbf{S}$ they are all a.s. bounded.

We will use the following result of Kallenberg to prove our next result.

Theorem 3.8 (see Theorem 4.19 in [12]).

*Let $\xi,\xi_{1},\xi_{2},...$ be a.s. bounded random measures on $S$ . Then these conditions are equivalent

(i) $\xi_{n}\stackrel{{\scriptstyle wd}}{{\rightarrow}}\xi$ ,

(ii) $\xi_{n}\stackrel{{\scriptstyle vd}}{{\rightarrow}}\xi$ , and $\xi_{n}(S)\stackrel{{\scriptstyle d}}{{\rightarrow}}\xi(S)$ .*

We are now ready to present our next result, which is similar to Theorem 3.5, but applies to $\hat{\mathcal{M}}_{S}$ and involves both the vague and the weak topology.

Theorem 3.9.

$\mathcal{A}$ * is dense in the space of all CRMs, considered as random elements in $\hat{\mathcal{M}}_{S}$ , endowed with either the vague topology or the weak topology, with respect to the convergence in distribution.*

Proof.

Consider first the case of $\hat{\mathcal{M}}_{S}$ endowed with the vague topology. Then, by the same arguments as the ones used in the proof of Theorem 3.5 we obtain the result.

For the weak topology case, by the same arguments as the ones used in the proof of Theorem 3.5 we have that $\xi_{n}\stackrel{{\scriptstyle vd}}{{\rightarrow}}\xi$ . Hence, according to Theorem 3.8 it remains to prove that $\xi_{n}(S)\stackrel{{\scriptstyle d}}{{\rightarrow}}\xi(S)$ , namely that $\alpha_{n}^{\prime}(S)+\sum_{j=1}^{n}\beta^{\prime}_{n,j}\stackrel{{\scriptstyle d}}{{\rightarrow}}\alpha(S)+\sum_{j=1}^{\infty}\beta_{j}$ . However, this has been proved in the proof of Theorem 3.5 – indeed, consider $f\equiv 1$ and notice that $\xi_{n}(S)$ and $\xi(S)$ are a.s. finite since $\xi_{n}$ and $\xi$ are almost surely bounded. Thus, the proof is complete. ∎

3.1 The density result for QID point processes

In this subsection we answer positively the following question: given any point process with independent increments is it possible to find a sequence of QID point processes with independent increments which converges in distribution to it?

Thus, in this subsection we restrict our focus to point processes with independent increments and check that the density result holds. There are two main reasons for doing this. First, the class of point processes with independent increments represents one of the most studied class of completely random measures due to their nice theoretical properties and their importance in applications. Second, we have an explicit formulation for the quasi-Lévy measure and the drift of QID random variables supported on finite subsets of $\mathbb{Z}_{+}$ (see Theorem 3.9 in [15]).

Let us first show the density result for random variables supported on $\mathbb{Z}_{+}$ .

Proposition 3.10.

The class of QID distributions supported on finite subsets of $\mathbb{Z}_{+}$ is dense in the class of probability distributions with support on $\mathbb{Z}_{+}$ with respect to weak convergence.

Proof.

Let $\mu$ be a probability distribution with support on $\mathbb{N}$ . For $n\in\mathbb{N}$ , define the discrete distribution $\mu_{n}$ concentrated on the lattice $\{0,...,2n\}$ by

[TABLE]

Then, $\mu_{n}\stackrel{{\scriptstyle w}}{{\rightarrow}}\mu$ as $n\rightarrow\infty$ . It remains to prove that each $\mu_{n}$ is a weak limit of QID distributions with support on $\{0,...,2n\}$ . W.l.o.g. assume that the approximating sequence of distributions $\sigma$ is such that $\sigma(\{j\})>0$ for every $j\in\{0,...,2n\}$ . Assume that the characteristic function $\hat{\sigma}$ has zeros (in the other case we can directly use Theorem 3.9 in [15] to conclude). Let $X$ be a random variable with distribution $\sigma$ and let $a_{j}=\mathbb{P}(X=j)>0$ for $j=0,...,2n$ . Then, the polynomial $f(w)=\sum_{j=0}^{2n}a_{j}w^{j}$ has zeroes on the unit circle. Factorizing, we obtain $f(w)=a_{2n}\prod_{j=1}^{2n}(w-\xi_{j})$ , where $\xi_{j}$ , $j=1,...,2n$ , denote the complex roots. Let $f_{h}(w)=a_{2n}\prod_{j=1}^{2n}(w-\xi_{j}-h)$ , where $w\in\mathbb{C}$ and $h>0$ . Then, for small enough $h$ , $f_{h}$ is a polynomial with real coefficients, namely $f_{h}(w)=\sum_{j=0}^{2n}a_{h,j}w^{j}$ with $a_{h,j}\in\mathbb{R}$ . Observe that for small enough $h$ , $a_{h,j}$ and $a_{j}$ will be close, so $a_{h,j}>0$ . Now, let $X_{h}$ be a random variable with distribution $\sigma_{h}=\left(\sum_{j=0}^{2n}a_{h,j}\right)^{-1}\sum_{j=0}^{2n}a_{h,j}\delta_{j}$ . We conclude by noticing that, for every $h>0$ , $X_{h}$ is random variable with values on the lattice $\{0,...,2n\}$ and its characteristic function has no zeros (thus it is QID by Theorem 3.9 in [15]), and that $X_{h}\stackrel{{\scriptstyle d}}{{\rightarrow}}X$ as $h\searrow 0$ . ∎

From Corollary 3.21 in [12], for an atomless point process with independent increments the corresponding unique pair, which we denote by $(\gamma,F)$ , is such that $\gamma=0$ and $F$ is restricted to $S\times\mathbb{N}$ .

Let $\mathcal{A}^{\prime}$ be the set of all the point processes in $\mathcal{A}$ . In other words, let

[TABLE]

Obviously, we have $\mathcal{A}^{\prime}\subsetneq\mathcal{A}\subsetneq\mathcal{I}$ .

Theorem 3.11.

$\mathcal{A}^{\prime}$ * is dense in the space of all point processes with independent increments with respect to the convergence in distribution.*

Proof.

It follows from the same arguments as the ones used in the proof of Theorem 3.5. In particular, now we need to use Proposition 3.10 instead of Theorem 3.2. Further, now $\gamma_{n}=\gamma=0$ and $F_{n}$ and $F$ are concentrated on $S\times\mathbb{N}$ . Then, following the same arguments as the ones used in the proof of Theorem 3.5 we obtain the result. ∎

We conclude this subsection with the density result for finite point processes (for which the weak topology might also be used), namely the equivalent of Theorem 3.9 for point processes with independent increments.

Proposition 3.12.

$\mathcal{A}^{\prime}$ * is dense in the space of point processes with independent increments, considered as random elements in $\hat{\mathcal{M}}_{S}$ , endowed with either the vague topology or with the weak topology, with respect to the convergence in distribution.*

Proof.

It follows from the same arguments as the ones used in the proof of Theorem 3.9, with Theorem 3.11 instead of Theorem 3.5. ∎

4 Properties of the dense class $\mathcal{A}$

In this section we explore some of the properties of the random measures in $\mathcal{A}$ , with a particular focus on spectral representations.

Consider the same notation as in the previous section. Let $\alpha$ be an atomless CRM (hence, ID). Using Theorem 12.10 and Corollary 12.11 in [11] we have that

[TABLE]

for every $\theta\in\mathbb{R}$ and $A\in\textbf{S}$ , where $\gamma$ is a finite diffuse measure on S and $F$ is a finite measure on $\textbf{S}\otimes\mathcal{B}((0,\infty))$ with diffuse projections onto $S$ . Observe that we can extend $F^{(1)}$ to a finite measure on $\textbf{S}\otimes\mathcal{B}(\mathbb{R})$ , by assigning value zero outside $\textbf{S}\otimes\mathcal{B}((0,\infty))$ ; by abuse of notation, we call this finite measure $F^{(1)}$ .

Further, let $\sum_{j=1}^{n}\delta_{s_{j}}\beta_{j}$ , where $n\in\mathbb{N}$ , $s_{j}\in S$ , $j=1,...,n$ , and where the $\beta_{j}$ ’s are mutually independent QID random variables with finite quasi-Lévy measure and zero Gaussian variance. With centering function equal zero (as in (8)), denote by $c_{j}$ and $b_{j}$ the drift and the quasi-Lévy measure of $\beta_{j}$ , for $j=1,...,n$ . Notice that we can use such centering function because the $\beta_{j}$ ’s have finite quasi Lévy measure. Then, the Lévy-Khintchine formulation of $\sum_{j=1}^{n}\delta_{s_{j}}\beta_{j}$ is given by

[TABLE]

for every $\theta\in\mathbb{R}$ and $A\in\textbf{S}$ , where $\gamma^{(2)}(A)=\sum_{j=1}^{n}\delta_{s_{j}}(A)c_{j}$ and $F_{A}^{(2)}(\cdot)=\sum_{j=1}^{n}\delta_{s_{j}}(A)b_{j}(\cdot)$ . Then, $\xi=\alpha+\sum_{j=1}^{n}\delta_{s_{j}}\beta_{j}$ has the following formulation

[TABLE]

for every $\theta\in\mathbb{R}$ and $A\in\textbf{S}$ , where $\nu_{0}(A)=\gamma^{(2)}(A)-\gamma^{(1)}(A)$ and $F_{A}(\cdot)=F_{A}^{(1)}(\cdot)+F_{A}^{(2)}(\cdot)$ .

Proposition 4.1.

Let $\xi\in\mathcal{A}$ and adopt the notation above. Then, $F$ extends uniquely to a finite signed measure on $\textbf{S}\otimes\mathcal{B}(\mathbb{R})$ .

Proof.

Consider the notations above. For the first statement we need to show that $F$ is a finite signed measure on $\textbf{S}\otimes\mathcal{B}(\mathbb{R})$ . Since $F^{(1)}$ is a finite measure on $\textbf{S}\otimes\mathcal{B}((0,\infty))$ , it remains to show that $F^{(2)}$ is a finite signed measure on $\textbf{S}\otimes\mathcal{B}(\mathbb{R})$ . We know that $F_{A}^{(2)}(\cdot)=\sum_{j=1}^{n}\delta_{s_{j}}(A)b_{j}(\cdot)$ where $b_{j}(\cdot)$ are finite signed measures on $\mathcal{B}(\mathbb{R})$ . It is possible to see that $F^{(2)}$ is a bimeasure on $\textbf{S}\times\mathcal{B}(\mathbb{R})$ and that

[TABLE]

where the supremum is taken over all the finite families of disjoints elements of $\textbf{S}\times\mathcal{B}(\mathbb{R})$ . Then, by Theorem 5.18 in [19] (see also Theorem 4 in [9]) $F^{(2)}$ extends to a finite signed measure on $\textbf{S}\otimes\mathcal{B}(\mathbb{R})$ . Thus, $F$ is a finite signed measure on $\textbf{S}\otimes\mathcal{B}(\mathbb{R})$ . ∎

Following the notation of the ID case (see [12] page 89), we call $F$ the quasi-Lévy measure of $\xi$ . Observe that in the ID case the Lévy measure might not even be $\sigma$ -finite (see [14] pages 82-83), while here our quasi-Lévy measure is a finite signed measure. Further, we remark that a similar result to Proposition 4.1 holds for $\xi\in\mathcal{A}_{\infty}$ (see Remark 3.7). In this case, $F^{(1)}$ is a measure (not necessarily $\sigma$ -finite) on $\textbf{S}\otimes\mathcal{B}(\mathbb{R})$ (see Corollary 3.21 in [12]), and $F^{(2)}$ is the same as in the proof of Proposition 4.1. Thus, in this case $F=F^{(1)}+F^{(2)}$ is a signed measure not necessarily finite.

In the following result we show the existence of a unique correspondence between any element in $\mathcal{A}$ and a characteristic pair.

Theorem 4.2.

*Let $\xi\in\mathcal{A}$ . Then, there exists a pair $(\nu_{0},F)$ s.t. (9) holds, where $\nu_{0}$ and $F$ are a finite signed measure on S and $\textbf{S}\otimes\mathcal{B}(\mathbb{R})$ , respectively, s.t. for every $A\in\textbf{S}$ and $B\in\mathcal{B}(\mathbb{R})$ :

(i) $\nu_{0}(A)=-\gamma(A)+\sum_{j=1}^{n}\delta_{s_{j}}(A)c_{j}$ , for some diffuse finite measure $\gamma$ on S, $c_{1},...,c_{n}\in\mathbb{R}$ , and finitely many atoms $s_{1},...,s_{n}\in S$ ,

(ii) $F(A\times B)=\tilde{G}(A\times B)+\sum_{j=1}^{n}\delta_{s_{j}}(A)b_{j}(B)$ , for some finite measure $\tilde{G}$ on $\textbf{S}\otimes\mathcal{B}(\mathbb{R})$ , which is the extension by zero of some measure $G$ on $\textbf{S}\otimes\mathcal{B}((0,\infty))$ with diffuse projections onto $S$ , and for some finite signed measures $b_{j}$ ’s on $\mathcal{B}(\mathbb{R})$ , such that $\exp(b_{1}),...,\exp(b_{n})$ are measures.*

Conversely, for every such pair $(\nu_{0},F)$ there exists a unique random measure $\xi\in\mathcal{A}$ s.t. (9) holds.

Proof.

Concerning the atomless component of $\xi$ , from Corollary 12.11 in [11] and Theorem 3.20 in [12] we know that there exists a one to one correspondence between an ID atomless random measure with independent increments and a characteristic pair, composed by a diffuse measure on S and a measure on $\textbf{S}\otimes\mathcal{B}(\mathbb{R})$ with diffuse projections onto $S$ . In our case we note that the components of the characteristic pair are finite measures by definition.

For the fixed component of $\xi$ , by Theorem 2.10 we know that a characteristic triplet where the Gaussian component is zero and the quasi-Lévy measure is finite is the characteristic triplet of a QID random variable if and only if the exponential of the finite quasi-Lévy measure is a measure.

Then, by the definition of $\xi$ and by the discussion and the computations at the beginning of this section on the characteristic functions of the components of $\xi$ , we immediately obtain the result.

Notice that for the converse direction we need also to show the independence of the fixed and atomless components, but this follows immediately from the linear structure of $\nu_{0}$ and $F$ . ∎

Remark 4.3.

Notation: instead of using the characteristic pair we could have equivalently used the characteristic set $(\{s_{j}\}^{n}_{j=1},\gamma,\{c_{j}\}^{n}_{j=1},G,\{b_{j}\}^{n}_{j=1})$ , with the above structure, in order to have a one to one identification with $\xi\in\mathcal{A}$ .

4.1 Properties of the dense class $\mathcal{A}^{\prime}$

Since $\mathcal{A}^{\prime}\subsetneq\mathcal{A}$ all the results presented in the previous section holds for $\xi\in\mathcal{A}^{\prime}$ . In this subsection, we show that even better results holds for the elements in $\mathcal{A}^{\prime}$ . This is mainly due to the fact that we have more information about the structure of these random measures.

Let us recall Theorem 3.9 in [15]. Despite we have used this theorem before we present it here to facilitate the reader in the understanding of the results of this subsection.

Theorem 4.4 (Theorem 3.9 in [15]).

*Let $\mu$ be a discrete distribution concentrated on $\{0,1,2,...,n\}$ for some $n\in\mathbb{N}$ , i.e., $\mu=\sum_{j=0}^{n}a_{j}\delta_{j}$ , where $a_{0},...,a_{n-1}\geq 0,$ $a_{n}>0$ , and $a_{0}+\cdots+a_{n}=1$ . Then the following are equivalent:

(i) $\mu$ is quasi-infinitely divisible.

(ii) The characteristic function of $\mu$ has no zeroes.

(iii) The polynomial $w\mapsto\sum_{j=0}^{n}a_{j}w^{j}$ in the complex variable $w$ has no roots on the unit circle, i.e. $\sum_{j=0}^{n}a_{j}w^{j}\neq 0$ , for all $w\in\mathbb{C}$ with $|w|=1$ .*

Further, if one of the equivalent conditions (i)-(iii) holds, then the quasi-Lévy measure of $\mu$ is finite and concentrated on $\mathbb{Z}$ , the drift lies in $\{0,1...,n\}$ , and the Gaussian variance of $\mu$ is 0. More precisely, if $\xi_{1},...,\xi_{n}$ denote the $n$ complex roots of $w\mapsto\sum_{j=0}^{n}a_{j}w^{j}$ , counted with multiplicity, then the quasi-Lévy measure of $\mu$ is given by

[TABLE]

and the drift is equal to the number of those zeroes of this polynomial which lie inside the unit circle (counted with multiplicity), i.e., have modulus less than 1.

In the following theorem we adopt the following notation. Let $\xi\in\mathcal{A}^{\prime}$ . We denote by $\alpha$ its atomless component and by $\beta_{j}$ , $j=1,...,n$ the QID random variables of its fixed component, i.e. $\xi\stackrel{{\scriptstyle a.s.}}{{=}}\alpha+\sum_{j=1}^{n}\beta_{j}\delta_{s_{j}}$ . Further, for every $j=1,...,n$ , we denote the law of $\beta_{j}$ by $\sum_{l=0}^{k_{j}}a_{j,l}\delta_{l}$ , namely $\mathcal{L}(\beta_{j})=\sum_{l=0}^{k_{j}}a_{j,l}\delta_{l}$ and denote by $\zeta_{j,1},...,\zeta_{j,k_{j}}$ the $k_{j}$ complex roots of $w\mapsto\sum_{l=0}^{k_{j}}a_{j,l}w^{l}$ . Finally, we denote by $b_{j}$ the quasi-Lévy measure of $\beta_{j}$ , i.e.

[TABLE]

and by $c_{j}$ its drift, i.e. $c_{j}=\#\{|\zeta_{j,l}|<1,l=1,...,k_{j}\}$ .

Theorem 4.5.

*Let $\xi\in\mathcal{A}^{\prime}$ . Then, there exists a pair $(\nu_{0},F)$ s.t. (9) holds, where $\nu_{0}$ and $F$ are a finite signed measure on S and $\textbf{S}\otimes\mathcal{B}(\mathbb{R})$ , respectively, s.t. for every $A\in\textbf{S}$ and $B\in\mathcal{B}(\mathbb{R})$ :

(i) $\nu_{0}(A)=\sum_{j=1}^{n}\delta_{s_{j}}(A)c_{j}$ , where $n\in\mathbb{N}$ , $s_{j}\in S$ is an atom, and $c_{j}=\#\{|\zeta_{j,l}|<1,l=1,...,k_{j}\}$ , for $j=1,...,n$ ,

(ii) $F(A\times B)=\tilde{G}(A\times B)+\sum_{j=1}^{n}\delta_{s_{j}}(A)b_{j}(B)$ , where $\tilde{G}$ is a finite measure on $\textbf{S}\otimes\mathcal{B}(\mathbb{R})$ restricted on $S\times\mathbb{N}$ and with diffuse projections onto $S$ , and where $b_{j}$ satisfies (11), for $j=1,...,n$ .*

Conversely, for every such pair $(\nu_{0},F)$ , where $\zeta_{j,1},...,\zeta_{j,k_{j}}$ denote the $k_{j}$ complex roots of some polynomial $w\mapsto\sum_{l=0}^{k_{j}}a_{j,l}w^{l}$ for $j=1,...,n$ , there exists a unique random measure $\xi\in\mathcal{A}$ s.t. (9) holds.

Proof.

It follows from the same arguments as the one used in Theorem 4.2 and from Theorem 4.4. In particular, the first direction is trivial. For the other direction, we have the following. As mentioned in the proof of Theorem 4.2, we have a one-to-one correspondence for the atomless part of $\xi$ and its characteristic pair. Concerning the fixed component, let us assume that there exist $c_{j}$ and $b_{j}$ which are functions of some complex roots of some complex polynomial $w\mapsto\sum_{l=0}^{k_{j}}a_{j,l}w^{l}$ with no roots in the unite circle, where $k_{j}\in\mathbb{N}$ , $a_{0},...,a_{k_{j}-1}\geq 0,$ $a_{k_{j}}>0$ , and $a_{0}+\cdots+a_{k_{j}}=1$ . Then, by Theorem 4.4 there exists a QID probability distribution $\mathcal{L}(\beta_{j})=\sum_{l=0}^{k_{j}}a_{j,l}\delta_{l}$ . Since this holds for every $j=1,...,n$ then from the set of atoms $s_{1},...,s_{n}\in S$ we obtain the fixed component $\sum_{j=1}^{n}\delta_{s_{j}}\beta_{j}$ of a random measure in $\mathcal{A}^{\prime}$ . ∎

The same comment in Remark 4.3 for Theorem 4.2 holds here for Theorem 4.5. In addition, we refer to [18] for further properties of certain subclasses of point processes with quasi-Lévy measures.

5 A Nonparametric Bayesian example

In this section we show how the setting and the results presented in Sections 3 and 4 apply to a particular class of nonparametric prior distributions. The framework is the one of the paper by Broderick, Wilson and Jordan [2]. This framework is also explored in subsequent papers, see [3] among others. In their work they analyse Bayesian nonparametric prior and likelihood based on CRMs. In particular, they let the prior to be modelled as:

[TABLE]

where the cardinality $K$ may be either finite or infinity and where $(\theta_{k},\psi_{k})$ is a pair consisting of the frequency (or rate) of the $k$ -th trait together with its trait $\psi_{k}$ , which belongs to some space $\Psi$ of traits. Further, they let the data point for the $m$ -th individual to be modelled as:

[TABLE]

where $x_{m,k}$ represents the degree to which the $m$ -th data point belongs to the trait $\psi_{k}$ .

This setting can be applied to many real world applications. In particular, in topic modelling we have that $\psi_{k}$ represents a topic; that is, $\psi_{k}$ is a distribution over words in a vocabulary. Further, $\theta_{k}$ might represent the frequency with which the topic $\psi_{k}$ occurs in a corpus of documents. Finally, $x_{j,k}$ represents the number of words in topic $\psi_{j,k}$ that occur in the $j$ th document. So the $j$ th document has a total length of $\sum_{k=1}^{K}x_{j,k}$ words. In this case, the actual observation consists of the words in each $m$ documents, and the topics of the whole corpus of documents are latent.

From a mathematical (and formal) point of view $\Theta$ and $X_{m}$ are defined as CRMs. In particular, for the data $X_{m}$ , we let $x_{m,k}$ be drawn according to some distribution $H$ that takes $\theta_{k}$ as a parameter and have support on $\mathbb{Z}_{+}$ , that is $x_{m,k}\stackrel{{\scriptstyle indep}}{{\sim}}h(x_{m,k}|\theta_{k})$ , independently across $m$ and $k$ . We assume that $X_{1},...,X_{m}$ are i.i.d. conditional on $\Theta$ . Moreover, [2] consider the following assumptions for $\Theta$ and $X_{m}$ :

Assumption A00: the atomless component of $\Theta$ has characteristic pair $(\gamma,F)$ s.t. $\gamma=0$ and $F(d\theta\times d\psi)=\nu(d\theta)\cdot G(d\psi)$ , where $\nu$ is any $\sigma$ -finite measure on $\mathbb{R}_{+}$ and $G$ is a proper distribution on $\Psi$ with no atoms.

Assumptions A0, A1, and A2: $\Theta$ has a finite number of fixed atoms, $\nu(\mathbb{R}_{+})=\infty$ , and $\sum_{x=1}^{\infty}\int_{\mathbb{R}_{+}}h(x|\theta)\nu(d\theta)$ $<\infty$ , respectively.

We remark that by Assumption A00 we have that the location of the non-fixed atoms $\psi$ and the frequencies $\theta_{k}$ are stochastically independent. We call $\nu$ the weights rate measure of $\Theta$ . Moreover, the assumptions A0, A1 and A2 comes from a modelling need. By assuming A0 we are saying that we initially know certain traits, by A1 that there are a countable infinity of possible traits, and by A2 that the amount of information from finitely represented data is finite (because by A2 the number of non-fixed atoms is finite).

The first main result in [2] is Theorem 3.1, which shows explicit formulations for the posterior distribution $\Theta|X_{1}$ , and it is extended in Corollary 3.2 to the posterior $\Theta|X_{1:m}$ . In the following result we are going to show that similar results hold for any random measure in $\mathcal{A}$ without assuming A0, A1 or A2.

Notice that we can write $\Theta=\sum_{k=1}^{K}\theta_{k}\delta_{\psi_{k}}$ , where $K=K_{fix}+K_{ord}$ , namely $K$ is the sum of the fixed and non-fixed atoms, thus $K$ is random. Following the notation of [2], we denote the fixed component of $\Theta$ by $\Theta_{fix}=\sum_{k=1}^{K_{fix}}\theta_{fix,k}\delta_{\psi_{fix,k}}$ and the law of $\theta_{fix,k}$ by $F_{fix,k}:=\mathcal{L}(\Theta(\{\psi_{fix,k}\}))$ .

Proposition 5.1.

Let $\Theta\in\mathcal{A}$ satisfying $A00$ . Write $\Theta=\sum_{k=1}^{K}\theta_{k}\delta_{\psi_{k}}$ , and let $X_{1},...,X_{m}$ be generated conditional on $\Theta$ according to $X_{1}:=\sum_{k=1}^{K}x_{1,k}\delta_{\psi_{k}}$ with $x_{1,k}\stackrel{{\scriptstyle indep}}{{\sim}}h(x|\theta_{k})$ for proper, discrete probability mass function $h$ . It is enough to make the assumption for $X_{1}$ since the $X_{1},...,X_{m}$ are i.i.d. conditional on $\Theta$ .

Then let $\Theta_{post}$ be a random measure with the distribution of $\Theta|X_{1:m}$ (i.e. $\Theta|X_{1},...,X_{m}$ ). $\Theta_{post}$ is a CRM with three parts.

1.* For each $k\in[K_{fix}]$ , $\Theta_{post}$ has a fixed atom at $\psi_{fix,k}$ with weight $\theta_{post,fix,k}$ distributed according to the finite-dimensional posterior $F_{post,fix,k}(d\theta)$ that comes from prior $F_{fix,k}$ , likelihood $h$ , and observation $X({\psi_{fix,k}})$ . Moreover, $F_{fix,k}$ is QID with no Gaussian component and finite quasi-Lévy measure, and $F_{post,fix,k}(d\theta)\varpropto F_{fix,k}(d\theta)\prod_{j=1}^{m}h(x_{fix,j,k}|\theta)$ .*

2.* Let $\{\psi_{new,k}:k\in[K_{new}]\}$ be the union of atom locations across $X_{1},X_{2},...,X_{m}$ minus the fixed locations in the prior of $\Theta$ . $K_{new}$ is finite. Let $x_{new,j,k}$ be the weight of the atom in $X_{j}$ located at $\psi_{new,k}$ , for some $j=1,...,m$ . Then $\Theta_{post}$ has a fixed atom at $x_{new,k}$ with random weight $\theta_{post,new,k}$ , whose distribution $F_{post,new,k}(d\theta)\varpropto\nu(d\theta)\prod_{j=1}^{m}h(x_{new,j,k}|\theta)$ .*

3.* The ordinary component of $\Theta_{post}$ has finite weights rate measure $\nu_{post,m}(d\theta):=\nu(d\theta)h(0|\theta)^{m}$ .*

Remark 5.2.

Observe that since $\Theta\in\mathcal{A}$ then it has finite fixed atoms so assumption A0 is satisfied. Moreover, since $\nu$ is also finite and $h(x|\theta)\leq 1$ , then assumption A2 is also satisfied. The only difference with Theorem 3.1 and Corollary 3.2 in [2] is that we do not necessarily satisfy assumption A1. However, A1 is a modelling assumption rather than a technical one. Indeed, the proof of this result follows from similar arguments as the one used in the proof of Theorem 3.1 and Corollary 3.2 in [2]. We write them for completeness.

Proof.

Let us first prove the result for $\Theta|X$ . Any fixed atom $\theta_{fix,k}\delta_{\psi_{fix,k}}$ in the prior is independent of the other fixed atoms and of the ordinary component. Thus, all of $X$ except $x_{fix,k}:=X(\{\psi_{fix,k}\})$ is independent of $\theta_{fix,k}$ . Thus, $\Theta|X$ has a fixed atom at $\psi_{fix,k}$ and $\mathcal{L}(\theta_{post,fix,k})\varpropto F_{fix,k}(d\theta)h(x_{fix,k}|\theta)$ . Recall that since $G$ is continuous, all the fixed and non-fixed atoms of $\Theta$ are at a.s. distinct locations. Observe that by letting $\Psi_{fix}:=\{\psi_{fix,1},...,\psi_{fix,K_{fix}}\}$ we can define the fixed and ordinary component of $X$ by $X_{fix}(A):=X(A\cap\Psi_{fix})$ and $X_{ord}(A):=X(A\cap(\Psi\setminus\Psi_{fix}))$ , respectively.

Let $x\in\mathbb{Z}_{+}$ and let $\{\psi_{new,x,1},...,\psi_{new,x,K_{new,x}}\}$ be all the locations of atoms in $X_{ord}$ of size $x$ , which is finite and it is a subset of the locations of atoms of $\Theta_{ord}$ . Further, let $\theta_{new,x,k}:=\Theta(\{\psi_{new,x,k}\})$ . Observe that the values $\{\theta_{new,x,k}\}_{k=1}^{K_{new,x}}$ are generated from a thinned Poisson point process with rate measure (also known as intensity measure) $\nu_{x}(d\theta)=\nu(d\theta)h(x|\theta)$ , this is due to the $h(x|\theta)$ -thinning of the Poisson point process $\{\theta_{ord,k}\}_{k=1}^{K_{ord}}$ which has rate measure $\nu$ . Moreover, given that $\nu_{x}(\mathbb{R}_{+})<\infty$ , we have that $\mathcal{L}(\theta_{new,x,k})\varpropto\nu(d\theta)h(x|\theta)$ . Finally, observe that there is a possibility that atoms in $\Theta_{ord}$ are not observed in $X_{ord}$ , this happens when the likelihood draw returns a zero. These atom weights form a Poisson point process with rate measure $\nu(d\theta)h(0|\theta)$ .

Considering $\Theta|X_{1}$ as the new prior we obtain the formulation for the posterior $\Theta|X_{1},X_{2}$ by induction and by observing that the assumptions are still satisfied by $\Theta|X_{1}$ . Then, by induction we conclude the proof. ∎

In the next result, we show that random measures in $\mathcal{A}$ satisfying A00 are dense in the space of all CRMs satisfying A0, A1 and A2, namely all the random measures considered in [2] (and in [3]). Further, we show how this result translates into a convergence for the ordinary component of the posterior of these random measures.

Proposition 5.3.

Consider any random measure $\Theta$ satisfying A00, A0, A1 and A2, namely as in Theorem 3.1 in [2]. Then, there exists a sequence of random measures $(\Theta_{n})_{n\in\mathbb{N}}$ in $\mathcal{A}$ and satisfying A00 such that $\Theta_{n}\stackrel{{\scriptstyle d}}{{\to}}\Theta$ , as $n\to\infty$ . Further, $\Theta_{n,post,ord}\stackrel{{\scriptstyle d}}{{\to}}\Theta_{post,ord}$ , as $n\to\infty$ .

Proof.

The first part of this proof consists in realising that the arguments in the proof of Proposition 3.4 and Theorem 3.5 adapt to the present case.

Denote by $F(d\theta\times d\psi)=\nu(d\theta)\cdot G(d\psi)$ the Lévy measure of $\Theta$ . Following the proofs of Proposition 3.4 and Theorem 3.5 it is possible to see that the approximating sequence $\Theta_{n}$ should have Lévy measure $\nu_{n}(d\theta)\cdot G_{n}(d\psi)$ where $\nu_{n}(d\theta):=\nu((\frac{1}{n},\infty)\cap d\theta)$ and $G_{n}(d\theta):=G(S_{n}\cap d\psi)$ . However, given the assumptions on $F$ , namely that $G$ is a finite measure, we can (and we do) take the Lévy measure of $\Theta_{n}$ to be given by $F_{n}(d\theta\times d\psi):=\nu_{n}(d\theta)\cdot G(d\psi)$ . Then, applying the same arguments as the one used in the proof of Proposition 3.4 and Theorem 3.5, we obtain that the ordinary component of $\Theta_{n}$ converge in distribution to the one of $\Theta$ . The convergence of the fixed component follows directly from Theorem 3.5. Since $F_{n}$ is finite, we have that $\Theta_{n}$ is in $\mathcal{A}$ and that it satisfies A00.

For the convergence of the posteriors, consider $\Theta_{n}$ with its respective data points $X_{n,1},....,X_{n,m}$ , which are defined conditional on $\Theta_{n}$ as in Proposition 5.1 and belong to some probability spaces possibly different from the one of the other data points. From Proposition 5.1 we have that $\Theta_{n,post}$ has finite weights rate measure $\nu_{n,post,m}(d\theta):=\nu_{n}(d\theta)h(0|\theta)^{m}$ , while from Corollary 3.2 in [2] we know that $\Theta_{post}$ has finite weights rate measure $\nu_{post,m}(d\theta):=\nu(d\theta)h(0|\theta)^{m}$ . Since $\nu_{n,post,m}(\cdot)=\nu_{post,m}((\frac{1}{n},\infty)\cap\cdot)$ we obtain the result by Proposition 3.4. ∎

We summarise our findings so far in words. First, we obtain an explicit expression for the posterior of any random measure in $\mathcal{A}$ satisfying A00. Second, such random measures are dense with respect to convergence in distribution in the space of all priors considered in [2]. Third, when approximating in distribution such a prior, the ordinary component of the posteriors of these random measures converge to the one of the prior.

Thus, by these results we have a random truncation procedure; this is so because the number of non-fixed atoms of the prior is random and almost surely finite for every $n\in\mathbb{N}$ . Thus, the present truncation procedure extends the one of [3]. Indeed, we do not arbitrarily fix the number non-fixed atoms of the truncated prior and we are able to keep explicit formulations for the posterior of the truncated prior.

In the next result we show that, under certain conditions, we have automatic conjugacy for random measures in $\mathcal{A}^{\prime}$ satisfying A00.

Proposition 5.4.

Let $\Theta\in\mathcal{A}^{\prime}$ satisfying A00 and with weights rate measure having finite support. Let $X$ be generated conditional on $\Theta$ according to $X:=\sum_{k=1}^{K}x_{k}\delta_{\psi_{k}}$ with $x_{k}\stackrel{{\scriptstyle indep}}{{\sim}}h(x|\theta_{k})$ for proper, discrete probability mass function $h$ . Assume that the characteristic functions of the random variables of the fixed component of $\Theta_{post}$ have no zeros, namely assume that for every $x\in\mathbb{N}$ , $z\in\mathbb{R}$ and $k\in[K_{fix}]$

[TABLE]

Then, $\Theta_{post}\in\mathcal{A}^{\prime}$ , satisfies A00 and has weights rate measure with finite support.

Proof.

Assumption (12) implies that the characteristic functions of $F_{post,fix,k}$ and of $F_{post,new,j}$ have no zeros. Further, they are also supported on a finite subset of $\mathbb{Z}_{+}$ . Then, by Theorem 4.4 we obtain the result. ∎

Remark 5.5.

Let $\Theta$ and $X$ be as in Proposition 5.4. Notice that we can write $F_{fix,k}=\sum_{j=0}^{n^{(k)}}a^{(k)}_{j}\delta_{j}$ , where $a_{0},...,a_{n-1}\geq 0,$ $a_{n}>0$ , and $a_{0}+\cdots+a_{n}=1$ , for $k\in K_{fix}$ . Further, we can write $\nu=\sum_{j=1}^{K_{\nu}}b_{j}\delta_{j}$ , where $K_{\nu}\in\mathbb{N}$ indicates the highest value in $\textnormal{supp}(\nu)$ , $b_{1},...,b_{K_{\nu}-1}\geq 0$ and $b_{K_{\nu}}>0$ . Assumption (12) can be rewritten as: For every $x\in\mathbb{N}$ , $z\in\mathbb{R}$ and $k\in[K_{fix}]$ , assume that

[TABLE]

Moreover, by Theorem 4.4 this assumption (and so assumption (12)) is equivalent to the following assumption: For every $x\in\mathbb{N}$ and $k\in[K_{fix}]$ , assume that the polynomials $w\mapsto\sum_{j=0}^{n^{(k)}}h(x|j)a_{j}^{(k)}w^{j}$ and $w\mapsto\sum_{j=1}^{K_{\nu}}h(x|j)b_{j}w^{j}$ in the complex variable $w$ have no roots on the unit circle.

Remark 5.6.

The results presented in this section holds also if the weights rate measure is infinite, namely $\nu(\mathbb{R}_{+})=\infty$ (under the additional assumptions A1 and A2). In particular, the equivalent of Proposition 5.1 would be identical to Corollary 3.2 except for the result of point 1, because here we additionally know that $F_{fix,k}$ is QID with no Gaussian component and finite quasi-Lévy measure. Further, the equivalent of Proposition 5.3 would follows from the arguments presented taking into consideration Remark 3.7. The equivalent of Proposition 5.4 is more subtle and it is presented below.

Consider the following class of QID random measures:

[TABLE]

Let $\mathcal{A}^{\prime\prime}_{\infty}$ indicate the set of random measures like in $\mathcal{A}$ but with $\alpha$ being any atomless point process with independent increments. As a side comment, we remark that is possible to see that a result similar to Theorem 4.2 and Theorem 4.5 holds for the elements in $\mathcal{A}^{\prime\prime}$ , where thanks to Theorem 8.1 in [15] we are able to know the structure of their Lévy-Khintchine representation in more details.

Proposition 5.7.

Let $\Theta\in\mathcal{A}^{\prime\prime}_{\infty}$ and assume A00, A0, A1 and A2. Let $X$ be generated conditional on $\Theta$ according to $X:=\sum_{k=1}^{\infty}x_{k}\delta_{\psi_{k}}$ with $x_{k}\stackrel{{\scriptstyle indep}}{{\sim}}h(x|\theta_{k})$ for proper, discrete probability mass function $h$ . Assume that the characteristic functions of the random variables of the fixed component of $\Theta_{post}$ have no zeros, namely assume that for every $x\in\mathbb{N}$ , $z\in\mathbb{R}$ and $k\in[K_{fix}]$

[TABLE]

Then, $\Theta_{post}\in\mathcal{A}^{\prime\prime}_{\infty}$ and satisfies A00, A0, A1 and A2.

Proof.

Assumption (13) implies that the characteristic functions of $F_{post,fix,k}$ and of $F_{post,new,j}$ have no zeros. Further, they are also supported on $\mathbb{Z}_{+}$ . Then, by Theorem 8.1 in [15] we obtain the result. ∎

Observe that assumption (13) can be rewritten more explicitly as done in Remark 5.5 for assumption (12).

Acknowledgement

The author would like to thank Almut Veraart, Fabio Bernasconi and Ismael Castillo for useful discussions. The research developed in this paper is supported by the EPSRC (award ref. 1643696) at Imperial College London and by the Fondation Sciences Mathématiques de Paris (FSMP) fellowship, held at LPSM (Sorbonne University).

Bibliography22

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] Berger D. On Quasi-Infinitely Divisible Distributions with a Point Mass. In press in Mathematische Nachrichten , (2019+).
2[2] Broderick, T., Wilson, A.C. and Jordan, M.I. Posteriors, conjugacy, and exponential families for completely random measures. Bernoulli , 24, 3181-3221, (2018)
3[3] Campbell, T., Huggins, J.H., How, J., and Broderick, T. Truncated random measures. Bernoulli , 25, 1256-1288 (2019)
4[4] Chaiba H., Demni, N., Mouayn, Z. Analysis of generalized negative binomial distributions attached to hyperbolic Landau levels. Journal of Mathematical Physics 57, 072-103 (2016).
5[5] Cuppens, R. Decomposition of Multivariate Probabilities. Academic Press, New York, (1975).
6[6] Demni, N., Mouayn, Z. Analysis of generalized Poisson distributions associated with higher Landau levels. Infinite Dimensional Analysis, Quantum Probability and Related Topics , 18(04), (2015).
7[7] Ghosal, S., Van der Vaart, A. Fundamentals of Nonparametric Bayesian Inference. Cambridge University Press (2017).
8[8] Harris. Random measures and motions of point processes. Z. Wahrsch. verw. Geb. 18, 85-115, (1971).

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Taxonomy

On quasi-infinitely divisible random measures

Riccardo Passeggeri111LPSM, Sorbonne University. Email: [email protected]

Abstract

Contents

1 Introduction

2 Notation and Preliminaries

Definition 2.1** (extended signed measure).**

Definition 2.2** (Signed bimeasure).**

Definition 2.3** (random measure).**

Definition 2.4** (completely random measure).**

Definition 2.5** (diffuse random measure).**

Remark 2.6**.**

Definition 2.7**.**

Definition 2.8** (quasi-Lévy type measure, quasi-Lévy measure, QID distribution, from [15]).**

Definition 2.9** (QID random measure).**

Theorem 2.10** (Theorem 4.3.4 in [5]).**

3 The density result for QID CRMs

Theorem 3.1** (see Theorem 4.11 in [12]).**

Theorem 3.2**.**

Proof.

Lemma 3.3**.**

Proof.

Proposition 3.4**.**

Proof.

Theorem 3.5**.**

Proof.

Remark 3.6**.**

Remark 3.7**.**

Theorem 3.8** (see Theorem 4.19 in [12]).**

Theorem 3.9**.**

Proof.

3.1 The density result for QID point processes

Proposition 3.10**.**

Proof.

Theorem 3.11**.**

Proof.

Proposition 3.12**.**

Proof.

4 Properties of the dense class A\mathcal{A}A

Proposition 4.1**.**

Proof.

Theorem 4.2**.**

Proof.

Remark 4.3**.**

4.1 Properties of the dense class A′\mathcal{A}^{\prime}A′

Theorem 4.4** (Theorem 3.9 in [15]).**

Theorem 4.5**.**

Proof.

5 A Nonparametric Bayesian example

Proposition 5.1**.**

Remark 5.2**.**

Proof.

Proposition 5.3**.**

Proof.

Proposition 5.4**.**

Proof.

Remark 5.5**.**

Remark 5.6**.**

Proposition 5.7**.**

Proof.

Acknowledgement

Definition 2.1 (extended signed measure).

Definition 2.2 (Signed bimeasure).

Definition 2.3 (random measure).

Definition 2.4 (completely random measure).

Definition 2.5 (diffuse random measure).

Remark 2.6.

Definition 2.7.

Definition 2.8 (quasi-Lévy type measure, quasi-Lévy measure, QID distribution, from [15]).

Definition 2.9 (QID random measure).

Theorem 2.10 (Theorem 4.3.4 in [5]).

Theorem 3.1 (see Theorem 4.11 in [12]).

Theorem 3.2.

Lemma 3.3.

Proposition 3.4.

Theorem 3.5.

Remark 3.6.

Remark 3.7.

Theorem 3.8 (see Theorem 4.19 in [12]).

Theorem 3.9.

Proposition 3.10.

Theorem 3.11.

Proposition 3.12.

4 Properties of the dense class $\mathcal{A}$

Proposition 4.1.

Theorem 4.2.

Remark 4.3.

4.1 Properties of the dense class $\mathcal{A}^{\prime}$

Theorem 4.4 (Theorem 3.9 in [15]).

Theorem 4.5.

Proposition 5.1.

Remark 5.2.

Proposition 5.3.

Proposition 5.4.

Remark 5.5.

Remark 5.6.

Proposition 5.7.