On non-local ergodic Jacobi semigroups: spectral theory, convergence-to-equilibrium and contractivity
Patrick Cheridito, Pierre Patie, Anna Srapionyan, Aditya, Vaidyanathan

TL;DR
This paper introduces non-local Jacobi operators as generators of ergodic Markov semigroups, analyzing their spectral properties, convergence rates, and contractivity, with explicit formulas and bounds, extending classical Jacobi polynomial theory.
Contribution
It develops a spectral and convergence theory for non-local Jacobi operators, including explicit semigroup expansions, spectrum characterization, and hypocoercivity results, generalizing classical Jacobi operator properties.
Findings
Semigroup admits a series expansion with generalized Jacobi polynomials
Spectrum of the non-self-adjoint generator is fully characterized
Variance decay is hypocoercive with explicit constants
Abstract
In this paper, we introduce and study non-local Jacobi operators, which generalize the classical (local) Jacobi operators. We show that these operators extend to generators of ergodic Markov semigroups with unique invariant probability measures and study their spectral and convergence properties. In particular, we derive a series expansion of the semigroup in terms of explicitly defined polynomials, which generalize the classical Jacobi orthogonal polynomials. In addition, we give a complete characterization of the spectrum of the non-self-adjoint generator and semigroup. We show that the variance decay of the semigroup is hypocoercive with explicit constants, which provides a natural generalization of the spectral gap estimate. After a random warm-up time, the semigroup also decays exponentially in entropy and is both hypercontractive and ultracontractive. Our proofs hinge on the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
On non-local ergodic Jacobi semigroups: spectral theory, convergence-to-equilibrium and contractivity
P. Cheridito
Department of Mathematics, ETH Zurich, Rämistrasse 101, 8092 Zurich, Switzerland.
,
P. Patie
School of Operations Research and Information Engineering, Cornell University, Ithaca, NY 14853.
,
A. Srapionyan
Center for Applied Mathematics, Cornell University, Ithaca, NY 14853.
and
A. Vaidyanathan
Center for Applied Mathematics, Cornell University, Ithaca, NY 14853.
Abstract.
In this paper, we introduce and study non-local Jacobi operators, which generalize the classical (local) Jacobi operators. We show that these operators extend to generators of ergodic Markov semigroups with unique invariant probability measures and study their spectral and convergence properties. In particular, we derive a series expansion of the semigroup in terms of explicitly defined polynomials, which generalize the classical Jacobi orthogonal polynomials. In addition, we give a complete characterization of the spectrum of the non-self-adjoint generator and semigroup. We show that the variance decay of the semigroup is hypocoercive with explicit constants, which provides a natural generalization of the spectral gap estimate. After a random warm-up time, the semigroup also decays exponentially in entropy and is both hypercontractive and ultracontractive. Our proofs hinge on the development of commutation identities, known as intertwining relations, between local and non-local Jacobi operators and semigroups, with the local objects serving as reference points for transferring properties from the local to the non-local case.
Key words and phrases:
Markov semigroups, spectral theory, non-self-adjoint operators, convergence to equilibrium, hypercontractivity, ultracontractivity, heat kernel estimates
2010 Mathematics Subject Classification:
37A30, 47D06, 47G20, 60J75
The authors are grateful to Paul Jenkins and Soumik Pal for fruitful discussions, and they would like to thank the anonymous referees for valuable comments.
1. Introduction
In this paper we study the non-local Jacobi operators, given for suitable functions , by
[TABLE]
where is the classical Jacobi operator
[TABLE]
and satisfy Assumption A below, and for two functions and with domains , we denote by the product convolution given by
[TABLE]
The classical Jacobi operator is a central object in the study of Markovian diffusions. For instance, it is a model candidate for testing functional inequalities such as the Sobolev and log-Sobolev inequalities; see for instance, Bakry [4] and Fontenas [26]. If , an integer, there exists a homeomorphism between this particular Jacobi operator and the radial part of the Laplace–Beltrami operator on the -sphere, revealing connections to diffusions on higher-dimensional manifolds that, in particular, lead to a curvature-dimension inequality as described in Bakry et al. [5, Chapter 2.7]. From the spectral theory viewpoint, the Markov semigroup is diagonalizable with respect to an orthonormal, polynomial basis of , where denotes its unique invariant probability measure. As a consequence, it can be shown that the semigroup converges to equilibrium in different ways, such as in variance or entropy, and is both hypercontractive as well as ultracontractive; see Appendix A, where we review essential facts about the classical Jacobi operator, semigroup and process. Classical Jacobi processes have been used in applications such as population genetics under the name Wright–Fisher diffusion, see e.g., Ethier and Kurtz [25, Chapter 10], Demni and Zani [20], Griffiths et al. [29, 28], Huillet [31] or Pal [38], as well as in finance; see e.g., Delbaen and Shirikawa [19] or Gourieroux and Jasiak [27].
Due to the non-local part of and its non-self-adjointness as a densely defined and closed operator in with denoting the invariant measure of the corresponding semigroup, a fact that is proved below, the traditional techniques that are used to study seem out of reach. Nevertheless, our investigation of yields generalizations of the classical results mentioned above. An important ingredient of our approach is the notion of an intertwining relation, which is a type of commutation relationship for linear operators. For fixed and parameters , to be specified below, we develop identities of the form
[TABLE]
on the space of polynomials as well as
[TABLE]
on and , respectively, where and are bounded linear operators. While (1.3) allows us to show that generates an ergodic Markov semigroup with unique invariant probability measure , we use (1.4) to obtain the spectral theory, convergence-to-equilibrium, hypercontractivity, and ultracontractivity estimates for .
The rest of the paper is organized as follows. The main results are stated in Section 2. All proofs are given in Section 3, and a specific family of non-local Jacobi semigroups is considered in Section 4. Known results on classical Jacobi operators, semigroups and processes are collected in Appendix A.
2. Main results on non-local Jacobi operators and semigroups
In this section we state our main results concerning the non-local operator defined in (1.1). Throughout the paper we make the following
Assumption A**.**
The function is assumed to be [math] outside of and to satisfy . Moreover, is a finite non-negative Radon measure on , and if ,
[TABLE]
while otherwise, .
2.1. Preliminaries and existence of Markov semigroup
Anticipating Theorem 2.1 below, we already mention that the càdlàg realization of the Markov semigroup has downward jumps from to , , which occur at a frequency given by the Lévy kernel ; see Lemma 3.1. Also note that for , we have and therefore, . Next, we consider the convex twice differentiable and eventually increasing function given by
[TABLE]
which is easily seen to always have 0 as a root, and has a root if and only if . Set
[TABLE]
and define by
[TABLE]
For instance, if , then
[TABLE]
and we note that (resp. ) is uniquely determined by , and (resp. and ), so that for fixed , there is a one-to-one correspondence between and . As we show in Lemma 3.2 below, is a Bernstein function; that is, is continuous, infinitely differentiable on , and for all and ; see Bertoin [8] or Schilling et al. [49] for a thorough exposition of Bernstein functions and subordinators. As a Bernstein function, admits an analytic extension to the right half-plane ; see e.g., Patie and Savov [41, Chapter 4]. We write for the unique solution, in the space of positive definite functions, to the functional equation
[TABLE]
with ; see Patie and Savov [40] for a thorough account on this set of functions that generalize the gamma function, which appears as a special case if . In particular, for any ,
[TABLE]
with the convention .
Let be the Banach space of continuous functions equipped with the sup-norm , and denote by the subspace of times continuously differentiable functions with . We call a one-parameter semigroup of linear operators on a Markov semigroup if for all and , , if , and . A probability measure on is invariant for if for all and ,
[TABLE]
where the last equality serves as the definition of . It is then classical, see either Bakry et al. [5] or Da Prato [18], that a Markov semigroup on with an invariant probability measure can be extended to the weighted Hilbert space
[TABLE]
Such a semigroup is said to be ergodic if, for every , in the -norm.
Next, for complex numbers such that , we denote by the Pochhammer symbol given by
[TABLE]
Writing for the algebra of polynomials on and denoting , , we formally define
[TABLE]
and note that in Lemma 3.2 we show that . Recall that a sequence is said to be Stieltjes moment determinate if it is the moment sequence of a unique probability measure on . Our first main result provides the existence of an ergodic Markov semigroup generated by the non-local Jacobi operator .
Theorem 2.1**.**
**
- (i)
* is a Stieltjes moment determinate sequence, and the corresponding probability measure is an absolutely continuous measure with support and has a continuous density that is positive on .* 2. (ii)
The extension of to an operator on , still denoted by , is the infinitesmal generator, having as a core, of an ergodic Markov semigroup on whose unique invariant measure is .
The proof of (ii) makes use of an intertwining relation stated in Proposition 3.6, which is an original approach to showing that the assumptions of the Hille–Yosida–Ray Theorem are fulfilled; see Lemma 3.8 for more details. More generally, the idea of constructing a new Markov semigroup by intertwining with a known, reference Markov semigroup goes back to Dynkin [23], whose ideas were extended by Rogers and Pitman in [46], leading to the characterization of Markov functions; that is, measurable maps that preserve the Markov property. More recently, Borodin and Olshanski [10] also used intertwining relations combined with a limiting argument to construct a Markov process on the Thoma cone.
We also point out that the invariant measure is a natural extension of the beta distribution, which is recovered if , as in this case we have in (2.6). The requirement in Assumption A that be a finite measure is necessary for the existence of an invariant probability measure for . Indeed, as we illustrate in our proof of Theorem 2.1, any candidate for such a measure must have moments given by (2.6). If is not a finite measure, then estimates by Patie and Savov in [40, Theorem 3.3] imply that the analytical extension of (2.6) to is not bounded along imaginary lines, a necessary condition for to be a probability measure.
2.2. Spectral theory of the Markov semigroup and generator
We proceed by developing the -spectral theory for both, the semigroup and the operator . Recalling that, for fixed , there is a one-to-one correspondence between and the Bernstein function given in (2.3), we define, for , the polynomial as
[TABLE]
where is given by
[TABLE]
Note that for , we get in (2.1), and the polynomials boil down, up to a normalizing constant if , to the classical Jacobi polynomials reviewed in Appendix A. Let us denote by the Rodrigues operator
[TABLE]
and set
[TABLE]
We write for the density given in Theorem 2.1.(i), and define, for every , the function as
[TABLE]
In particular, the function
[TABLE]
will be useful for us in the sequel. Let us denote by the usual Lebesgue space of square-integrable functions on .
Proposition 2.2**.**
Set , and define, for , as
[TABLE]
where is the product convolution operator defined in (1.2). Then, . Moreover, if , then , and if , then .
Remark 2.3**.**
The definition in (2.10) makes sense regardless of the differentiability of , since and . However, the differentiability of is limited by the smoothness of , which is quantified by the index . Note that for , one has and, by moment identification and determinacy, it is easily checked that (2.10) boils down, up to a multiplicative constant, to the Rodrigues representation of the classical Jacobi polynomials given in (A.6). In this sense, and generalize in different ways, related to different representations of these orthogonal polynomials.
We call two sequences biorthogonal if for and otherwise, and then write for the projection operator given by . Moreover, a sequence admitting a biorthogonal sequence will be called minimal and a sequence that is both minimal and complete, in the sense that its linear span is dense in , will be called exact. It is easy to show that a sequence is minimal if and only if none of its elements can be approximated by linear combinations of the others. If this is the case, then a biorthogonal sequence is uniquely determined if and only if is complete. Next, a sequence is said to be a Bessel sequence if there exists a constant such that, for all ,
[TABLE]
The quantity is a Bessel bound of , and the smallest such is called the optimal Bessel bound of ; see e.g., Christensen [16] for further information on these objects that play a central role in non-harmonic analysis.
We write for the spectrum of the operator in and for its point spectrum, and similarly define and . For an isolated eigenvalue we write and for the algebraic and geometric multiplicity of , respectively. We also define, for ,
[TABLE]
noting that , which explains our choice of notation, and recall that ; see Appendix A. Writing for the -adjoint of , we have the following spectral theorem for .
Theorem 2.4**.**
Let .
- (i)
Then,
[TABLE]
where the series converges in operator norm and is an exact Bessel sequence with optimal Bessel bound 1 and unique biorthogonal sequence , which is also exact. Moreover, for all , () is an eigenfunction of () with eigenvalue . 2. (ii)
The operator is compact; that is, the semigroup is immediately compact. 3. (iii)
The following spectral mapping theorem holds:
[TABLE]
Furthermore, and, for any ,
[TABLE] 4. (iv)
The operator is self-adjoint in if and only if .
The expansion in Theorem 2.4.(i) is not valid for since is a Bessel sequence but not a Riesz sequence, as it is not the image of an orthogonal sequence under a bounded linear operator having a bounded inverse; see Proposition 3.20 below. The sequence of non-self-adjoint projections is not uniformly bounded in , see Remark 3.19, and, in contrast to the self-adjoint case, the eigenfunctions of and do not form a Riesz basis of . Finally, it follows from Theorem 2.4.(iv) that for all .
2.3. Convergence-to-equilibrium and contractivity properties
We call a function , defined on an interval , admissible if
[TABLE]
Given an admissible function we write, for any with ,
[TABLE]
for the so-called -entropy of . An important special case is with , so that (2.13) gives the variance of . Recall that in the classical case , we have the following equivalence between the Poincaré inequality for and the spectral gap inequality for ,
[TABLE]
where the infimum is over all functions in the -domain of such that ; see e.g., Bakry et al. [5, Chapter 4.2]. The above variance decay is optimal in the sense that the decay rate does not hold for any constant strictly greater than . Another important instance of (2.13) corresponds to and . It recovers the classical notion of entropy for a non-negative function, written simply as . Here the classical equivalence is between the log-Sobolev inequality and entropy decay,
[TABLE]
where the infimum is over all functions in the -domain of such that . Note that the optimal entropy decay rate is obtained only for , in which case, , while otherwise ; see, e.g., Fontenas [26]. We review these notions for the classical Jacobi semigroup in Appendix A. For more details, we refer to Chafaï [14], Ané et al. [2] and the relevant sections of Bakry et al. [5]. However, due to the non-self-adjointness and non-local properties of , it seems challenging to develop an approach based on the Poincaré or log-Sobolev inequalities. For this reason, we take an alternative route to tackling convergence to equilibrium by using the concept of interweaving relations recently introduced by Patie and Miclo in [34, Section 3.5] and [35].
Now, consider the function given by
[TABLE]
and recall that for , we have . Note that is a Bernstein function, as it is obtained by translating and centering the well-known Bernstein function . In the literature is known as the Laplace exponent of the so-called relativistic -stable subordinator; see Bakry [3] or Bogdan et al. [9]. Recalling that any Bernstein function is analytic on the right half-plane, and writing , we denote
[TABLE]
Then , and if , one has . So, in any case, . For , we write
[TABLE]
noting that if , then . Next, for any and , we denote by a non-negative random variable, whose existence is provided in the theorem below, with Laplace transform
[TABLE]
and write .
Theorem 2.5**.**
Let . Then, for all and , the following hold:
- (i)
For any ,
[TABLE]
and . 2. (ii)
The function , defined in (2.16), is a Bernstein function. Therefore, is infinitely divisible and there exists a subordinator with . For any with , one has
[TABLE]
and if , then
[TABLE]
Furthermore, if and , , is an admissible function as in (2.12), then, for any such that and ,
[TABLE]
Remark 2.6**.**
Since , the estimate in Theorem 2.5.(i) gives the hypocoercivity, in the sense of Villani [51], for non-local Jacobi semigroups. This notion continues to attract research interest, especially in the area of kinetic Fokker–Planck equations; see, e.g. Baudoin [6], Dolbeault et al. [21] or Mischler and Mouhout [36]. We are able to identify the hypocoercive constants, namely the exponential decay rate as twice the spectral gap and the constant in front of the exponential, which is a measure of the deviation of the spectral projections from forming an orthogonal basis and is 1 in the case of an orthogonal basis. Note that in general, the hypocoercive constants may be difficult to identify and may have little to do with the spectrum. Similar results have been obtained by Patie and Savov [41] as well as Achleitner et al. [1]. Our hypocoercive estimate is obtained via intertwining, which suggests that hypocoercivity may be studied purely from this viewpoint, an idea that is further investigated in the recent work [43] by the second and fourth author.
Remark 2.7**.**
The second part of Theorem 2.5 gives the exponential decay of in entropy, but after an independent random warm-up time. Note that, for the entropy decay rate is the same as for while under the mild assumption , we get the optimal rate irrespective of the precise value of . The proof relies on developing so-called interweaving relations, a concept which has been introduced and studied in the recent work [35] by Miclo and Patie, where the classical Jacobi semigroup serves as a reference object; see Proposition 3.21 below.
Remark 2.8**.**
The additional condition for the -entropic convergence in Theorem 2.5.(ii) ensures that we can invoke the known result (A.13) for the classical Jacobi semigroup . However, our approach allows us to immediately transfer any improvement of (A.13) to the non-local Jacobi semigroup .
Next, we recall the famous equivalence between entropy decay and hypercontractivity due to Gross [30]. For any and such that , one has
[TABLE]
where we use the shorthand for . To state our next result we write, if , for the Sobolev constant of of order , and recall that as a result of the Sobolev inequality for , one gets for , which implies that is ultracontractive, that is, for all ; see Appendix A for more details. We have the following concerning the contractivity of .
Theorem 2.9**.**
For any and , the following hold:
- (i)
For , we have the hypercontractivity estimate
[TABLE]
and furthermore, if , then
[TABLE] 2. (ii)
If, in addition, , then for , we have the ultracontractivity estimate
[TABLE]
where for , one can choose , yielding .
2.4. Bochner subordination of the semigroup
We write for the semigroup subordinated, in the sense of Bochner, with respect to the subordinator whose existence is guaranteed by Theorem 2.5.(ii), that is,
[TABLE]
so that . Note that is also an ergodic Markov semigroup on with as an invariant measure, and its generator is given by ; see Sato [48, Chapter 6]. For the subordinated semigroup we have the following.
Theorem 2.10**.**
For any and the statement of Theorem 2.4 holds for and if is replaced with
[TABLE]
and the assertions of Theorem 2.5.(ii) and Theorem 2.9.(i) hold for if is replaced with and with 1. Moreover, for all and such that , for any and , where the heat kernel satisfies the estimate
[TABLE]
for Lebesgue-almost all . As above, if , one can choose , yielding .
We point out that the Markov process realization of (resp. ) has "only negative (resp. non-symmetric two-sided) jumps and can easily be shown to be a polynomial process on in the sense of Cuchiero et al. [17]. Markov semigroups obtained by subordinating with respect to any conservative subordinator with Laplace exponent (growing fast enough at infinity, e.g. logarithmically) are also in this class, and we obtain the spectral expansion of the subordinated semigroup from Theorem 2.4 by replacing with . Note that in the aforementioned paper the authors investigate the martingale problem for general polynomial operators on the unit simplex, of which and are specific instances. In particular, is a Lévy type operator with affine jumps of type 2 in the sense of [17]. For such operators, existence and uniqueness for the martingale problem have been shown in [17] under the weaker condition . However, Assumption A allows us to obtain the existence and uniqueness of an invariant probability measure.
3. Proofs
3.1. Preliminaries
We start by proving some preliminary results that will be useful throughout the paper. We first give an alternative form of the operator , which will make some later proofs more transparent. Recall that , given by , is a finite non-negative Radon measure on .
Lemma 3.1**.**
One has , and the operator defined in (1.1) may, for suitable functions , be written as
[TABLE]
Proof.
Since and
[TABLE]
one obtains that . Consequently, one has for all ,
[TABLE]
Thus, by Tonelli’s theorem and a change of variables,
[TABLE]
which yields
[TABLE]
Integration by parts and a change of variables give
[TABLE]
and the lemma follows. ∎
In the sequel we keep the notation , and , . Let be the function given by
[TABLE]
The following result collects some useful properties of the functions and given in (2.3) and (3.2), respectively.
Lemma 3.2**.**
**
- (i)
* is a Bernstein function satisfying .* 2. (ii)
, given in (2.2), satisfies with if and only if . Additionally, if , then , while if , then . 3. (iii)
Suppose . Then is a Bernstein function that is in correspondence with the non-local Jacobi operator with parameters , and the non-negative function , where is the finite non-negative Radon measure given by
[TABLE]
Furthermore, with and .
Proof.
First we rewrite (2.1) using integration by parts to get, for any ,
[TABLE]
Since, by Lemma 3.1, we have , we recognize as the Laplace exponent of a spectrally negative Lévy process with a finite mean given by . In particular, is a convex, eventually increasing, twice differentiable function on that is zero at [math]. Therefore, it has a strictly positive root if and only if . By the Wiener–Hopf factorization of Lévy processes, see e.g., [33, Chapter 6.4], we get for (resp. ) that (resp. ) for a Bernstein function . That then follows from the well-known result that , which can be obtained by dominated convergence since is a finite measure. This completes the proof of (i).
Next, we show , which, by the convexity of is equivalent to . Indeed, from (3.3) and an application of Tonelli’s theorem, we get
[TABLE]
where we used the assumption . If , then , and we obtain from (3.3),
[TABLE]
which gives . On the other hand, if , then the fact that forces , which completes the proof of (ii).
To show (iii), we write . According to [15, Proposition 2.2], is the Laplace exponent of a spectrally negative Lévy process with Gaussian coefficient 1, mean and Lévy measure . Observe that and
[TABLE]
So the Wiener–Hopf factorization of shows that is a Bernstein function. Moreover, integration by parts of gives
[TABLE]
where the boundary terms are easily seen to evaluate to 0. Finally, using the assumption , we get that , while the condition follows from the assumption that . ∎
3.2. Proof of Theorem 2.1.(i)
Before we begin we provide an analytical result, which will allow us to show that the support of is and will also be used in subsequent proofs. We say that a linear operator is a Markov multiplicative kernel if for some random variable . With as in (2.14), we denote, for any ,
[TABLE]
recalling that for , we have , so that at least . Note that if , explaining the notation. By [41, Lemma 10.3], the mapping
[TABLE]
is a Bernstein function, and, by Proposition 4.4(1) in the same paper, we also have that, for any ,
[TABLE]
is a Bernstein function. We define the following linear operators acting on the space of polynomials , recalling that for , , .
[TABLE]
where is defined for any and was defined in (3.2). We write for the unital Banach algebra of bounded linear operators on and say that a linear operator between two Banach spaces is a quasi-affinity if it has trivial kernel and dense range.
Lemma 3.3**.**
The operators , and defined in (3.7) are Markov multiplicative kernels associated to random variables , and , respectively, valued in , and hence moment determinate. All 3 random variables have continuous densities, and all 3 operators belong to . Furthermore, is a quasi-affinity on while and have dense range in .
Proof.
The claims regarding the operators and and corresponding random variables have been proved in [41]; see Theorem 5.2, Proposition 6.7(1) and Section 7.1 therein. Let be the function characterized by its Laplace transform via
[TABLE]
and note that is increasing. Moreover, since has a Gaussian component, is at least continuously differentiable; see e.g., [33, Section 8.2]. The law of the random variable is given by
[TABLE]
So it clearly is supported on and has a continuous density. The claims concerning were shown in [42, Lemma 4.2], where we note that since has a Gaussian component. ∎
Now, suppose , so that, by Lemma 3.2, . Then, for all , (2.6) reduces to
[TABLE]
Since , we get that given in (3.6) is a Bernstein function. Indeed, if , we must have , and the function is Bernstein since , see e.g. [49, Chapter 16], while if , Proposition 4.4(1) of [49] guarantees that is a Bernstein function. One straightforwardly checks that
[TABLE]
for all , and it follows from [7] that is indeed a Stieltjes moment determinate sequence of a probability measure . Its absolute continuity follows from [39, Proposition 2.4].
Now suppose , so that , and observe that (2.6) factorizes as
[TABLE]
where, by the above arguments, the first term in the product is a Stieltjes moment sequence, whereas the second term is the moment sequence of a beta distribution (see (A.3)). Consequently, in this case, one also has that is a Stieltjes moment sequence, and we temporarily postpone the proof of its moment determinacy and absolute continuity to after the proof of Lemma 3.4. We write for the sequence obtained from (2.6) by replacing with defined in (3.2) and with the same .
Lemma 3.4**.**
For all and , we have the following factorization of operators on the space ,
[TABLE]
where the second identity holds if and the third if .
Remark 3.5**.**
Once we have established the moment determinacy of for , the operator factorizations in Lemma 3.4 extend to the space of bounded measurable functions. Indeed, (3.8) implies
[TABLE]
where the second identity holds if and the third if ; , , and are random variables with laws , , and , respectively, and denotes the product of independent random variables.
Proof.
By (3.7), we have for all ,
[TABLE]
By considering the cases and separately, we obtain the desired right-hand side, noting that is well-defined since , due to and [41, Proposition 4.4(1)].
Next, we note that by Lemma 3.2.(iii), if and only if . So, for all ,
[TABLE]
which, by linearity, shows the second identity.
Finally, we know from Lemma 3.2.(iii) that . Hence, [math] is the only non-negative root of , and therefore,
[TABLE]
Straightforward computations give that, for any ,
[TABLE]
Putting these observations together yields
[TABLE]
where we used the recurrence relations for both, the gamma function and ; see e.g. (2.4). This completes the proof. ∎
Now suppose that for , the measure is moment indeterminate. Then, as the sequence is a non-vanishing Stieltjes moment sequence, it follows from (3.8) by invoking [7, Lemma 2.2] that also the beta distribution is moment indeterminate, which is a contradiction. Therefore, we conclude that in all cases, is moment determinate, and consequently we have the extended operator factorizations of Remark 3.5.
To obtain the absolute continuity of in the case , we note that the factorization implies, by moment determinacy, that is the product convolution of two absolutely continuous measures, and hence, again absolutely continuous.
If we take , then and . In this case, we denote . For , we obtain from Remark 3.5 that
[TABLE]
Since, by Lemma 3.3, the law of is supported in and has a continuous density, we obtain from the second identity in (3.9) that the support of is for a constant , and has a continuous density that is positive on . But since the law is also supported in , we deduce from the first identity in (3.9) that .
The case follows from analogous arguments with and replaced by and , respectively, where we note that, since , the support of is and has a continuous density that is positive on . This completes the proof of Theorem 2.1.(i). ∎
3.3. Proof of Theorem 2.1.(ii)
We start by proving the following more general intertwining that will be useful in subsequent proofs, recalling the definition of in (3.7).
Proposition 3.6**.**
With and as in (2.15) and (3.4), respectively, we have, for any ,
[TABLE]
Remark 3.7**.**
Note that is the common parameter of the Jacobi type operators in (3.10) while the constant part of the affine drift as well as the non-local components are different. The commonality of is what ensures the isospectrality of these operators, as their spectrum depends only on ; see Theorem 2.4.(ii) and (A.7).
We split the proof of Proposition 3.6 into two lemmas. Among other things, our proof hinges on the interesting observation that intertwining relations are stable under perturbation with an operator that commutes with the intertwining operator, see Lemma 3.9 below. Let be the operator defined as
[TABLE]
write , where satisfies Assumption A, and set .
Lemma 3.8**.**
With the notation of Proposition 3.6, one has
[TABLE]
Proof.
Using , one obtains for any ,
[TABLE]
Combining this with (3.7) gives
[TABLE]
while on the other hand,
[TABLE]
where the second equality follows by considering the cases and separately. Now, the lemma follows from the linearity of the involved operators. ∎
For a Borel measure on , define , and denote by the operator given by
Lemma 3.9**.**
Let be a Borel measure on satisfying for all . Then
[TABLE]
for all and .
Proof.
It follows from the assumptions that
[TABLE]
∎
Proof of Proposition 3.6.
It is now an easy exercise to complete the proof of Proposition 3.6. Let us write
[TABLE]
Then, for any , we get from Lemmas 3.8 and 3.9 and the linearity of the involved operators,
[TABLE]
∎
Having established the necessary intertwining relation, we are now able to show that extends to the generator of a Markov semigroup.
Lemma 3.10**.**
The operator is closable in , and its closure is the infinitesimal generator of a Markov semigroup on .
Proof.
We want to invoke the Hille–Yosida–Ray theorem for Markov generators, see [12, Theorem 1.30], which requires that and, for some , are dense in and in addition, satisfies the positive maximum principle on .
The density of in follows from the Stone–Weierstrass theorem. To show that is dense in for some , we set . Then , in which case, we write . By Lemma 3.3, is injective and bounded on . So, the inverse is a linear operator on . But since is a Markov multiplicative kernel, we get by injectivity that and . Putting these observations together, we deduce from the first intertwining in Proposition 3.6 that
[TABLE]
Hence, for any ,
[TABLE]
where we used the trivial commutation of with . For , Assumption A guarantees that since and . Therefore, belongs to the domain of , which is explicitly described in (A.1), and as is an invariant subspace of the classical Jacobi semigroup , we get that is a core of , see [12, Lemma 1.34]. Hence, we obtain from the reverse direction of the Hille–Yosida–Ray theorem that is dense in for any . Since the image of a dense subset under a bounded operator with dense range is also dense in the codomain, is dense in for any .
Now, consider and such that . If , one has and therefore,
[TABLE]
If , we use Lemma 3.1 to write
[TABLE]
and we observe that
[TABLE]
If , we must have and , from which one obtains . On the other hand, if , then and so, since . This shows that satisfies the positive maximum principle on , and it follows that extends to the generator of a Feller semigroup in the sense of [12, Theorem 1.30]. The fact that is conservative, i.e. , follows from
[TABLE]
which is a consequence of ; see e.g. [12, Lemma 1.26]. ∎
Proof of Theorem 2.1.(ii).
To complete the proof it suffices to establish the claims concerning the invariant measure. For we have,
[TABLE]
where we have used Proposition 3.6 with , Lemma 3.4 and the fact that is the invariant measure of . This shows that on the dense subset of , which implies that is an invariant measure of ; see for instance [5, Section 1.4.1]. To show uniqueness, we note that any other invariant measure of must have all positive moments finite and satisfy
[TABLE]
for any , where we again used that . By uniqueness of the invariant measure of , we obtain the factorization on , and the moment determinacy of then forces . Finally the extension of to a Markov semigroup on is classical, see for instance the remarks before the theorem, and it is well-known that if has a unique invariant measure, it is an ergodic Markov semigroup; see e.g. [18, Theorem 5.16]. ∎
3.4. Proof of Proposition 2.2
To prove Proposition 2.2, we first show two auxiliary results, the first of which provides a characterization of the functions appearing in (2.10). We recall that the Mellin transform of a finite measure , resp. an integrable function , on is given by
[TABLE]
which is valid for at least . We denote by (resp. ), with reals, the linear space of functions such that there exist for which, for all ,
[TABLE]
(resp. the linear space of continuous linear functionals on endowed with a structure of a countably multinormed space as described in [37, p. 231]). For any and , we denote
[TABLE]
where denotes the Rodrigues operator defined in (2.8) and the last identity follows from (A.6). For any complex number we recall that the Pochhammer notation to any such that is given in (2.5), and, for the remainder of the proofs, we shall write for the -inner product, adopting the same notation for other weighted Hilbert spaces.
We know from Lemma 3.3 that has a continuous density on . The corresponding Markov multiplicative kernel is given by . We write and denote by the operator given by .
Proposition 3.11**.**
For any , the Mellin convolution equation
[TABLE]
has a unique solution, in the sense of distributions, given by
[TABLE]
Its Mellin transform is given, for any with , by
[TABLE]
Proof.
The proof is an adaptation of the proof of [41, Lemma 8.5] to the current setting. Since the mapping is analytic on and , for any , see for instance [41, Proposition 6.8], we deduce from [37, Theorem 11.10.1] that , for every and for every . So, for and with , , we have
[TABLE]
On the other hand, for any , we get, from [37, 11.7.7] and a simple computation,
[TABLE]
So we deduce that the Mellin transform of a solution to (3.14) takes the form
[TABLE]
Since for , is analytic with , we deduce from [37, Theorem 11.10.1] that , for any . Hence, by means of [37, 11.7.7], we have that with is a solution to (3.14), and the uniqueness of the solution follows from the uniqueness of Mellin transforms in the distributional sense. ∎
Lemma 3.12**.**
For and , we have the estimate
[TABLE]
which holds uniformly on bounded -intervals and for large enough, where is a constant depending on and the considered -interval, and is given in (2.9).
Proof.
By uniqueness of in the space of positive-definite functions, the Mellin transform of is given by
[TABLE]
where , with . Invoking [40, Equation (6.20)], we get the following estimate, which holds uniformly on bounded -intervals and for large enough,
[TABLE]
with a constant depending on , and where, for any , with denoting the Lévy measure of . We know form Lemma 3.2.(ii) that if and otherwise. Moreover, if , we obtain from (2.3) that . Thus to utilize the estimate in (3.17) we need to identify in the case . To do that, let us write , where . From the fact that , we conclude that is itself a function of the form (3.3), which gives , where is the Lévy measure of and the Lévy measure of obtained via (3.3). As is a Bernstein function it is given, for , by
[TABLE]
for some . Thus, for ,
[TABLE]
The third equality follows from Tonelli’s theorem, justified as all integrands therein are non-negative, and using . Thus we deduce
[TABLE]
where the latter follows by some straightforward integration by parts and shows that is indeed the Lévy measure of . Next, an application of [41, Proposition 4.1(9)] together with another integration by parts yields . Putting the different pieces together, we get , so that in all cases . Therefore, we can deduce from (3.17) that
[TABLE]
which, as before, holds uniformly on bounded -intervals and for large enough. Next, we recall the following classical estimate for the gamma function
[TABLE]
where is a constant continuously depending on . Combining this estimate with the one in (3.18) we thus get, uniformly on bounded -intervals and for large enough,
[TABLE]
for a constant . Since is a function of and the constants in the estimate for the gamma function, it follows that it only depends on and -interval on which the estimate holds. Finally, the fact that follows from Lemma 3.2.(ii). ∎
Proof of Proposition 2.2.
Note that and trivially, . Then, well-known properties of convolution give , and that is a well-defined -function. To show that implies , we note that the classical estimate for the gamma function given in (3.19) yields that, for with fixed,
[TABLE]
where is a positive constant depending only on , , and . Thus, we get from (3.15) that has the same rate of decay along imaginary lines as . So Lemma 3.12 together with Parseval’s identity for Mellin transforms shows that . Finally, since , the differentiability of is determined by the differentiability of . Invoking Lemma 3.12, we get for and large enough that
[TABLE]
uniformly on bounded -intervals, where is a constant. A classical Mellin inversion argument then gives if . ∎
3.5. Proof of Theorem 2.4
To prove this result we need to develop further intertwinings for and lift these to the level of semigroups. We write for the non-local Jacobi operator with parameters , and , as in Lemma 3.2, which is in one-to-one correspondence with the Bernstein function defined in (3.2).
Lemma 3.13**.**
For any , the following identities hold on :
[TABLE]
in the cases and , respectively.
Proof.
It suffices to prove that and hold on , where we write and refer to (3.11) and the subsequent discussion for the definitions, as then the same arguments as in the proof of Proposition 3.6 will go through. In the case , we have, for any and using the recurrence relation of the gamma function,
[TABLE]
On the other hand, since and ,
[TABLE]
which proves the claim in this case. Finally,
[TABLE]
while on the other hand, using the definition of in (3.2),
[TABLE]
which, by linearity, completes the proof of the lemma. ∎
The following result lifts the intertwinings of the Propositions 3.6 and 3.13 to the level of semigroups. We here write to emphasize the one-to-one correspondence, for fixed , between and .
Proposition 3.14**.**
For all and , the following identities hold for all on the appropriate -spaces,
[TABLE]
with the latter two holding for and , respectively.
We need an auxiliary result concerning the corresponding intertwining operators, which extends their boundedness from to the corresponding weighted Hilbert spaces. For two Banach spaces and , we denote by the space of bounded linear operators from to .
Lemma 3.15**.**
For all , and , the operators , and belong to , and , respectively, and have operator norm 1.
Proof.
Let and . Then, applying Jensen’s inequality to the Markov multiplicative kernel together with Lemma 3.4 gives
[TABLE]
where we used that . Since is a probability measure on the compact set , it follows that is a dense subset of ; see e.g. [22, Corollary 22.10]. So by density, we conclude that is in with operator norm less than or equal to 1. Equality then follows from . The case is a straightforward consequence of being a Markov multiplicative kernel, and the claims regarding the other operators are deduced similarly from Lemma 3.4. ∎
Next, since and are generators of -Markov semigroups, it follows that their resolvent operators, given for , by
[TABLE]
are bounded, linear operators on . We write (resp. ) for the resolvent corresponding to (resp. ).
Lemma 3.16**.**
For all , and , we have the following identities on :
[TABLE]
where the second one holds for and the last one for .
Proof.
We shall only provide the proof of the first claim, which relies on the intertwining in Proposition 3.6, as the other claims follow by invoking Proposition 3.13 and involve the same arguments, mutatis mutandis. First, suppose that and and let so that there exists such that . Applying to both sides of this equality gives
[TABLE]
where in the third equality we have used Proposition 3.6, which is justified as . This equality may be rewritten as and consequently, for any , we get
[TABLE]
Thus it remains to show the inclusions and for which we recall, from the proof of Proposition 3.6, that with , for any . A straightforward computation gives that and hence
[TABLE]
from which it follows, by the injectivity of on , that
[TABLE]
Rearranging the above yields the equation
[TABLE]
which is justified as, for any , both roots of the quadratic equation are always negative. Note that so by iteratively using the equality in (3.23), we conclude that, for any , , and by linearity follows. Similar arguments applied to then allow us to also conclude that , which completes the proof. ∎
Proof of Proposition 3.14.
We are now able to complete the proof of Proposition 3.14. As was shown in the proof of Proposition 3.16 above and using the notation therein, and , so that on we have
[TABLE]
and, by induction, for any ,
[TABLE]
In particular, for any and ,
[TABLE]
Now, taking the strong limit in as of the above yields, by the exponential formula [44, Theorem 8.3] and the continuity of the involved operators guaranteed by Lemma 3.3, for any and ,
[TABLE]
where is the classical Jacobi semigroup on with parameters and . By density of in and since Lemma 3.15 with gives , it follows that the identity in (3.24) extends to , which completes the proof of the first identity. The other two identities follow from similar arguments and so the proof is omitted. ∎
For , we define, for , the quantity as
[TABLE]
where the first equality comes from some straightforward algebra given the definition of in (A.5). Note that, with we get , for all . We shall need the following result.
Lemma 3.17**.**
For any the mapping is strictly increasing on with
[TABLE]
Proof.
Using the definition in (3.25) we get that
[TABLE]
Since each term in the product is strictly greater than 1 and together with Stirling’s formula for the gamma function this completes the proof. ∎
Now, we write , and for the Hilbertian adjoints of the operators , and , respectively.
Proposition 3.18**.**
Let and . Then, for all ,
[TABLE]
and with as in (3.4), the sequence is a complete Bessel sequence in with Bessel bound 1. Furthermore, for any , we have,
[TABLE]
while
[TABLE]
and is the unique biorthogonal sequence to in , which is equivalent to being the unique -solution to for any . In all cases is a complete Bessel sequence in with Bessel bound 1.
Remark 3.19**.**
Note that Proposition 3.18 yields -norm bounds for the functions and . Indeed, writing for the -norm we get, from the boundedness claims of Lemma 3.15, for all and ,
[TABLE]
where and we used Lemma 3.17 for the two estimates. We show in the proof below that
[TABLE]
and since , invoking again Lemma 3.17, we have that the above ratio grows with .
Proof.
By (2.7), (3.7), (A.4) and linearity of , one obtains
[TABLE]
Recall that the sequence forms an orthonormal basis of and thus, as the image under a bounded operator of an orthonormal basis, we get that is a Bessel sequence in with Bessel bound given by the operator norm of , which by Lemma 3.15, is 1. If , we have , so that the first claim is proved in this case. In the case we suppose, without loss of generality, that and . Then reduces to
[TABLE]
and one gets from (2.7), (3.7), (A.4) and linearity of
[TABLE]
By Lemma 3.15, belongs to with operator norm 1 and thus, by similar arguments as above, we deduce that is a Bessel sequence in with Bessel bound .
We continue with the claims regarding , starting with the case . A simple calculation shows that for ,
[TABLE]
where with and the density of the random variable , whose existence is provided in Lemma 3.3. Thus it suffices to show that, for all ,
[TABLE]
To do that, we note that for ,
[TABLE]
and
[TABLE]
So invoking the uniqueness claim of Proposition 3.11 yields the representation (3.28) for . The case follows by similar arguments, albeit with more tedious algebra, and its proof is omitted.
Next, we note that for ,
[TABLE]
As is an orthonormal sequence in , we have for all ,
[TABLE]
and thus we get that is a biorthogonal sequence in of . It follows from Lemma 3.15 that belongs to and has operator norm . So, since forms an orthonormal basis of , one obtains that is a Bessel sequence in with Bessel bound 1.
To show uniqueness, we first observe that any sequence biorthogonal to must satisfy
[TABLE]
which means that is biorthogonal to . However, since is an orthonormal basis for , one must have , and so, . Since by Lemma 3.3, is dense in , one has , and we conclude that is the unique sequence in biorthogonal to .
Finally, assume now that . Then, using the definition of in (3.2) and Lemma 3.2.(iii), we get that
[TABLE]
On the other hand, since , see (3.7), simple algebra yields that
[TABLE]
We know that, since , is the unique sequence biorthogonal to . Combining this with (3.31) gives
[TABLE]
which shows that is biorthogonal to . Uniqueness follows as above.
Finally, the completeness of is a consequence of the fact that it is, in all cases and by Lemmas 3.3 and 3.15, the image under a bounded linear operator with dense range of the sequence , which is itself is complete. ∎
Proof of Theorem 2.4.
We tackle the different claims of Theorem 2.4 sequentially. Setting in (2.15) we get, by the first intertwining in Proposition 3.14, (3.27) and the spectral expansion of the self-adjoint semigroup in (A.8), that for any and ,
[TABLE]
where the second identity is justified by and the fact that is a Bessel sequence in , see [16, Theorem 3.1.3], and the last identity uses the fact that, by Proposition 3.18, is the unique -solution to the equation . It follows that is an eigenfunction for with eigenvalue . Taking the adjoint of the first identity in Proposition 3.14 and using the self-adjointness of on yields and thus, for any and ,
[TABLE]
Since , this shows that .
Next, let be the linear operator on defined by
[TABLE]
so that, by the above observations,
[TABLE]
For convenience, we set , . Then, for any and we have, for a constant independent of ,
[TABLE]
where the first inequality follows from the asymptotic in (3.26) combined with the decay of the sequence , , and the second inequality follows from the Bessel property of guaranteed by Proposition 3.18. Hence we deduce that and, as is a Bessel sequence, it follows that defines a bounded linear operator on for any , again by [16, Theorem 3.1.3]. However, on , a dense subset of . Therefore, by the bounded linear extension theorem, we have on for any . Note that, by similar Bessel sequence arguments as above, for any ,
[TABLE]
Since the supremum on the right-hand side is decreasing in for any , we get
[TABLE]
in the operator norm topology, where each is of finite rank. This completes the proof of (i) of Theorem 2.4 and also shows that is a compact operator for any , which is claim (ii).
Next, the intertwining identity (3.21) and the completeness of and enable us to invoke [41, Proposition 11.4] to obtain the equalities for algebraic and geometric multiplicities in (iii), and also to conclude that
[TABLE]
Since is compact, is so too, and thus as well as . To establish the remaining equalities we use the immediate compactness of to invoke [24, Corollary 3.12] and obtain , while we also have from [24, Theorem 3.7] that, . Putting all of these together completes the proof of (iii).
It remains to prove the last item concerning the self-adjointness of . Clearly if then is self-adjoint, as in this case reduces to and reduces to the classical Jacobi semigroup , which is self-adjoint on . Now suppose that is self-adjoint on , that is for all . By differentiating in the identity, for any ,
[TABLE]
we deduce, by a simple application of Fubini’s Theorem using the finiteness of the measure , that
[TABLE]
Note that (3.32) holds trivially if either or , or if , so we may suppose that ; all together we take, without loss of generality, . Now, for any , a straightforward calculation shows that
[TABLE]
where we recall from (2.3) that and from (2.11) that . Using (3.33) on both sides of (3.32) and rearranging gives
[TABLE]
By (2.6) and the recurrence relations for and the gamma function, the ratio evaluates to
[TABLE]
so that substituting into (3.34) shows that the following must be satisfied
[TABLE]
Next, we write as
[TABLE]
where the first equality is simply the definition of in (2.1) and the second follows from the assumption that . Let us write and . By direct verification we get
[TABLE]
so that (3.35) is equivalent to
[TABLE]
Observe that
[TABLE]
while
[TABLE]
Hence canceling on both sides of (3.36), then dividing by and rearranging the resulting equation yields
[TABLE]
Applying the dominated convergence theorem when taking the limit as of the right-hand side we find that, for all with ,
[TABLE]
which implies that . This completes the proof of claim (iv).∎
To conclude this section we give a result concerning the intertwining operators in Proposition 3.14 which illustrates that, except in the self-adjoint case of and , none of these operators admit bounded inverses. This latter fact combined with the relation (3.30) imply that is a not a Riesz sequence in , as it is not the image of an orthogonal sequence by an invertible bounded operator, see [16]. Recall that a quasi-affinity is a linear operator between two Banach spaces with trivial kernel and dense range.
Proposition 3.20**.**
Let and .
- (i)
Then, the operators , and are quasi-affinities. 2. (ii)
If , the operator admits a bounded inverse if and only if and . In all cases, and do not admit bounded inverses.
Proof.
Since polynomials belong to the -range of the operators , , and , we get, by moment determinacy, that each of these has dense range in their respective codomains. For the remaining claims we proceed sequentially by considering each operator individually, starting with . Proposition 3.18 gives that, for any ,
[TABLE]
and also that and are biorthogonal. Consequently,
[TABLE]
However, as forms an orthonormal basis for it must be its own unique biorthogonal sequence, which forces
[TABLE]
for all . Thus we conclude that , so that by moment determinacy of , we get that . Next, by straightforward computation we have, for any ,
[TABLE]
where the second equality follows by using the definition of , see (3.5), together with the recurrence relation for . Now, the same arguments as in the proof of [41, Theorem 7.1(2)] may be applied, see e.g. Section 7.3 therein, to get that the ratio in (3.37) tends to 0 as if and only if and . This is because, with the notation of the aforementioned paper, the expression for is equal to in our notation, and we have from . From the definition of in (3.5) we find that, if , then and from Lemma 3.2.(iii) we get that if while is always zero when , which shows that if then . On the other hand, from (3.5), it is clear that if then always . This completes the proof of the claims regarding . Next, by Proposition 3.18, , for each , and as proved in Proposition 3.14, the sequence is complete. Thus is dense in , or equivalently . By direct calculation we get that,
[TABLE]
where was defined in (3.6). Now the fact that allows us to deduce and, as noted earlier, is a Bernstein function and hence non-decreasing. As the case is excluded by the assumption on , we get that, as , the ratio in (3.38) tends to 0. Next, by taking the adjoint of (3.21) we get
[TABLE]
and using this identity we get that is an eigenfunction for associated to the eigenvalue . Then, Theorem 2.4.(iii) forces , and the latter is a complete sequence, whence . Finally, another straightforward calculation gives that
[TABLE]
and using the fact that we conclude that the right-hand side tends to 0 as . ∎
3.6. Proof of Theorem 2.5(i)
Theorem 2.4 gives, for any and ,
[TABLE]
so that, since and ,
[TABLE]
Next, we note that
[TABLE]
since
[TABLE]
and , which is trivial when , as then , while if we have from [41, Proposition 4.4(1)]. Now, we claim that the following computation is valid, writing again for the -norm and ,
[TABLE]
To justify this we start by observing that the first inequality follows from (3.39) together with being a Bessel sequence with Bessel bound 1, which was proved in Proposition 3.18. Next we use the fact that is an eigenfunction for associated to the eigenvalue , and then the identity
[TABLE]
which follows by considering the cases and separately. Indeed, when then and , while otherwise so that and . The second inequality follows from (3.40) and then we use the biorthogonality of and , given by Proposition 3.18, which implies that for any , if . The last inequality follows from the fact that is a Bessel sequence with Bessel bound 1, again due to Proposition 3.18. Next, if and since , we get
[TABLE]
so that the contractivity of the semigroup yields, for and any ,
[TABLE]
Finally, since is an invariant probability measure,
[TABLE]
which completes the proof. ∎
3.7. Proof of Theorem 2.5.(ii)
We first give a result that strengthens the intertwining relations in Proposition 3.14 and falls into the framework of the work by Miclo and Patie [35]. Write for the Markov multiplicative kernel associated to a random variable with law , which, by the same arguments as in the proof of Lemma 3.15, satisfies . We write and, for , let and otherwise let . Recall that a function is said to be completely monotone if and , for and . By Bernstein’s theorem, any completely monotone function is the Laplace transform of a positive measure on , and if (resp. ) then is the Laplace transform of finite (resp. probability) measure on , see e.g. [49, Chapter 1].
Proposition 3.21**.**
Under the assumptions of the theorem, we have an interweaving relationship between and , in the sense of [35], that is for and on the respective -spaces
[TABLE]
where is a Bernstein function with being the completely monotone function given by
[TABLE]
Proof.
We give the proof only in the case , so that , as the other case follows by similar arguments. From Proposition 3.14 we get, with ,
[TABLE]
and taking the adjoint and using that both and are self-adjoint on and , respectively, we get that
[TABLE]
Combining this with the first intertwining relation in Proposition 3.14 then yields
[TABLE]
and, together with second intertwining relation in Proposition3.6, we conclude that
[TABLE]
As is self-adjoint with simple spectrum the commutation identity (3.42) implies, by the Borel functional calculus, see e.g. [47], that for some bounded Borel function , and to identify it suffices to identify the spectrum of . To this end we observe that, for any ,
[TABLE]
where we used that forms an orthonormal basis for and the identity follows by a straightforward, albeit tedious, computation. Consequently, for any ,
[TABLE]
where the second and third equalities follow from calculations that were detailed in the proof of Proposition 3.18. Using the definition of in (3.25) we thus get that, for ,
[TABLE]
recalling from (2.11) that are the eigenvalues of , which proves that . Next, one readily computes that the non-negative inverse of the mapping is given by the function defined prior to the statement of the theorem, which was remarked to be a Bernstein function. For another short proof of this fact, observe that, for ,
[TABLE]
which is completely monotone. Since is the Laplace transform of the product convolution of the beta distributions and we may invoke [49, Theorem 3.7] to conclude is completely monotone. Finally, to show that is a Bernstein function we note that, for any , the function is a Bernstein function, see e.g. Example 88 in [49, Chapter 16]. Since
[TABLE]
with , and the composition of Bernstein functions remains Bernstein together with the fact that the set of Bernstein functions is a convex cone, see e.g. [49, Corollary 3.8] for both of these claims, it follows that is a Bernstein function. ∎
Proof of Theorem 2.5.(ii).
Since we may apply Proposition 3.21 to conclude that and a straightforward substitution gives , , with a Bernstein function. From the Borel functional calculus we get, since is self-adjoint on , that
[TABLE]
Combining this identity with (3.41) yields, for non-negative ,
[TABLE]
and the general case follows by linearity and by decomposing into the difference of non-negative functions. By Proposition 3.20, has trivial kernel on . So we deduce
[TABLE]
and thus satisfies an interweaving relation with , in the sense of [35]. Consequently we may invoke [35, Theorems 7, 24] to transfer the entropy decay and -entropy decay of , reviewed in Appendix A, to the semigroup but after a time shift of the independent random variable . Note that, when , we may take so that the reference semigroup is , which has optimal entropy decay rate. ∎
3.8. Proof of Theorem 2.9
The proof of Theorem 2.9(i) follows by using Equation 3.43 above to invoke [35, Theorem 8]. Next, by Equation 3.43 and using Proposition 3.21, we get
[TABLE]
where the last inequality follows by applying Lemma 3.15 twice, once in the case for and once with for . The claim now follows from the corresponding ultracontractivity estimate for . ∎
3.9. Proof of Theorem 2.10
The following arguments are taken from the proof of [34, Proposition 5]. We denote by for the classical Jacobi semigroup subordinated with respect to . By [35, Theorem 3] we obtain, from Proposition 3.21, an interweaving relationship between the subordinate semigroups, i.e. writing and as above, we have, for any and on the appropriate -spaces,
[TABLE]
Using this we get, for any and ,
[TABLE]
where in the second equality we used the boundedness of together the expansion for the subordinated classical Jacobi semigroup which follows from (A.8) and standard arguments, then the properties of and detailed in previous sections, and finally the expression for in (2.16). All of the claims, save for the last one, then follow from [35, Theorems 7, 24] applied to (3.44). Next, we establish the ultracontractive bound for . From (2.16) we get, by applying Stirling’s formula for the gamma function together with , that . Writing for convenience , we get by assumption on the parameters that so that the previous asymptotic yields, for ,
[TABLE]
where the two equalities follow by applying Tonelli’s theorem together with a change of variables, and the inequality follows from the fact that, for all , is non-increasing, recalling the notation . Hence, from the ultracontractive bound , valid for all , we deduce that for
[TABLE]
Consequently from (3.44) we get that, for ,
[TABLE]
Then it is easy to complete the proof of the last claim by following similar arguments as in the proof of [5, Proposition 6.3.4], noting that the required variance decay estimate therein, namely
[TABLE]
valid for all and , follows trivially from Theorem 2.5.(i) via subordination. ∎
4. Examples
In this section we consider a parametric family of non-local Jacobi operators for which is a power function. More specifically, let and consider the integro-differential operator given by
[TABLE]
Then is a non-local Jacobi operator with and , , or one easily gets that equivalently , . One readily computes that and thus the condition is always satisfied, which implies that . Writing for the Bernstein function in one-to-one correspondence with , we have that for ,
[TABLE]
From the right-hand side of (4.1) we easily see that . Now, we assume that and, for sake of simplicity, take . The following result characterizes all the spectral objects for these non-local Jacobi operators.
Proposition 4.1**.**
**
- (i)
The density of the unique invariant measure of the Markov semigroup associated to is given by
[TABLE]
where is the density of a Beta random variable of parameter and , see (A.2). 2. (ii)
We have that and, for ,
[TABLE]
*where, on the right-hand side, we made explicit the dependence on the two parameters for the classical Jacobi polynomials, i.e. , see (A.4), and where . * 3. (iii)
For any the function is given by
[TABLE]
where has the so-called Barnes integral representation, see e.g. **[13]**, for any ,
[TABLE]
and .
Proof.
First, from (4.1) and (2.4) we get that, for any ,
[TABLE]
so that from (2.6) we deduce that
[TABLE]
The first term on the right of (4.3) is the -moment of the probability density on while the second term is the -moment of a density. Thus, by moment identification and determinacy, we conclude that and after some easy algebra we get, for , that
[TABLE]
which completes the proof of the first item. Next, substituting (4.2) in (2.7), gives , and for ,
[TABLE]
where, to compute the second equality we made a change of variables and used the recurrence relation of the gamma function, and the definition of the classical Jacobi polynomials, see Section A and also [50]. This completes the proof of (ii). To prove (iii) we recall from (2.10) that, for any , , where, by (3.16), the Mellin transform of is given, for any , as
[TABLE]
used twice the functional equation for the gamma function and the definition of the constant in the statement. Next, writing for any and , we recall from (3.19) that there exists a constant such that
[TABLE]
where we recall that and . Hence, since is analytic on the right half-plane, by Mellin’s inversion formula, see e.g. [37, Chapter 11], one gets for any ,
[TABLE]
where the integral is absolutely convergent for any . Note that this is a Barnes-integral since we can write, again using the functional equation for the gamma function,
[TABLE]
see for instance [13]. Next, since , it follows that the function does not have any poles, while the function has simple poles at for all . Consequently, by Cauchy’s residue theorem we have, for any ,
[TABLE]
where we used that the integrals along the two horizontal segments of any closed contour vanish, as by (4.4) they go to [math] when . We justify the radius of convergence of the series as follows. Since , using Euler’s reflection formula for the gamma function, i.e. , , we conclude that
[TABLE]
where we used that for . Using the recurrence relation of the gamma function we deduce that the radius of convergence of this series is , which completes the proof. ∎
Appendix A Classical Jacobi operator and semigroup
A.1. Introduction and boundary classification
Before we begin reviewing the classical Jacobi operator, semigroup and process, we clarify the notational conventions that we use for these objects throughout the paper. Namely, with being fixed, instead of writing we suppress the dependency on and simply write , and similarly for the beta distribution, Jacobi semigroup and polynomials. The exception is when any of these objects depend in a not-straightforward way on , in which case we highlight the dependency explicitly. Now, fix constants and let be the classical Jacobi semigroup whose càdlàg realization is the Jacobi process with values in , that is, for bounded measurable functions ,
[TABLE]
Then is a Feller semigroup with infinitesimal generator , given, for , by
[TABLE]
Note that if the state space of the Jacobi process is taken to be , then the associated infinitesmal generator is
[TABLE]
and setting yields
[TABLE]
Since the operator is degenerate at the boundaries of , it is important to specify how the process behaves at these points. After some straightforward computations, as outlined in [11, Chapter 2] and using the notation therein, we get that the boundaries are classified as follows
[TABLE]
Therefore, our assumptions Assumption A on and guarantee that both [math] and are at least entrance, and may be regular or entrance-not-exit depending on the particular values of and . Let us write for the domain of the generator of the Feller semigroup. To specify it we recall that the so-called scale function of satisfies
[TABLE]
Let and denote the right and left derivatives of a function with respect to , i.e.
[TABLE]
Then,
[TABLE]
In particular, , since for any we have
[TABLE]
From the boundary conditions in (A.1) we get that if any point in is regular then it is necessarily a reflecting boundary for the Jacobi process with .
A.2. Invariant measure and -properties
The classical Jacobi semigroup has a unique invariant measure , which is the following beta distribution on :
[TABLE]
and we recall that, for any ,
[TABLE]
Since is invariant for , we get that extends to a contraction semigroup on and, moreover, the stochastic continuity of ensures that this extension is strongly continuous in . Consequently, we obtain a Markov semigroup on , which we still denote by . The eigenfunctions of are the Jacobi polynomials given, for any and , by
[TABLE]
where we denote
[TABLE]
These polynomials are orthogonal with respect to the measure and, by choice of , satisfy the normalization condition
[TABLE]
In particular, they form an orthonormal basis of . They have the alternative representation
[TABLE]
where are the Rodrigues operators given in (2.8) and
[TABLE]
All these relations follow by the change of variables and simple algebra from the corresponding relations for the polynomials defined in [32, Chap. 4], which are orthogonal with respect to the weight , and are also called Jacobi polynomials in the literature. Indeed, and are related through
[TABLE]
Next, the eigenvalue associated to the eigenfunction is, for ,
[TABLE]
Observe that for , (A.7) reduces to and , so that denotes the largest, non-zero eigenvalue of , which is also called the spectral gap. The semigroup then admits the spectral decomposition given, for any and , by
[TABLE]
where the sum converges in the operator norm. The domain of , the generator of the Markov semigroup, in can then be identified as
[TABLE]
A.3. Variance and entropy decay, hypercontractivity and ultracontractivity
As mentioned in the introduction, the fact that has nice spectral properties and satisfies certain functional inequalities gives quantitative rates of convergence to the equilibrium measure . For instance, from (A.8) one gets the following variance decay estimate, valid for any and ,
[TABLE]
which may also be deduced directly from the Poincaré inequality for , see [5, Chapter 4.2]. This convergence is optimal in the sense that the decay rate does not hold for any constant greater than . Next, let us write for the log-Sobolev constant of defined as
[TABLE]
Once always has , and in the case of the symmetric Jacobi operator, i.e. , one gets
[TABLE]
while otherwise , see e.g. [26], although the equality for the symmetric case goes back to [45]. As a consequence of (A.9) we have on the one hand, the convergence in entropy for any and such that ,
[TABLE]
and on the other hand, from Gross [30], the hypercontractivity estimate; that is, for all ,
[TABLE]
From (A.10) we thus get that the symmetric Jacobi semigroup attains the optimal entropic decay and hypercontractivity rate. Moreover, if , there exists a homeomorphism between and the radial part of the Laplace–Beltrami operator on the -sphere, which leads to the curvature-dimension condition ; see [5]. Thus for any function satisfying the admissibility condition (2.12), one has
[TABLE]
for any and such that . If , the operator also satisfies a Sobolev inequality, see e.g. [4], and thus one obtains from [5, Theorem 6.3.1] that, for ,
[TABLE]
where is the Sobolev constant for of exponent , given by
[TABLE]
The fact that is a contraction on together with the above ultracontractive bound yields for any . Finally, we mention that , and upper and lower bounds are known in the general case; see again [4].
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] F. Achleitner, A. Arnold, and E. A. Carlen. On multi-dimensional hypocoercive BGK models. Kinet. Relat. Models , 11(4):953–1009, 2018.
- 2[2] C. Ané, S. Blachère, D. Chafaï, P. Fougères, I. Gentil, F. Malrieu, C. Roberto, and G. Scheffer. Sur les Inégalités de Sobolev Logarithmiques . Société Mathématique de France, 2000.
- 3[3] D. Bakry. Étude des transformations de Riesz dans les variétés riemanniennes à courbure de Ricci minorée. In Séminaire de Probabilités, XXI , volume 1247 of Lecture Notes in Math. , pages 137–172. Springer, Berlin, 1987.
- 4[4] D. Bakry. Remarques sur les semigroupes de Jacobi. Astérisque , (236):23–39, 1996. Hommage à P. A. Meyer et J. Neveu.
- 5[5] D. Bakry, I. Gentil, and M. Ledoux. Analysis and Geometry of Markov Diffusion Operators , Volume 348 of Grundlehren der Mathematischen Wissenschaften . Springer, Cham, 2014.
- 6[6] F. Baudoin. Bakry–Émery meet Villani. J. Funct. Anal. , 273(7):2275–2291, 2017.
- 7[7] C. Berg and A. J. Durán. A transformation from Hausdorff to Stieltjes moment sequences. Ark. Mat. , 42(2):239–257, 2004.
- 8[8] J. Bertoin. Subordinators: Examples and Applications. In Lectures on Probability Theory and Statistics (Saint-Flour, 1997) , volume 1717 of Lecture Notes in Math. , pages 1–91. Springer, Berlin, 1999.
