Analytic Evaluation of the Fractional Moments for the Quasi-Stationary Distribution of the Shiryaev Martingale on an Interval
Kexuan Li, Aleksey S. Polunchenko, Andrey Pepelyshev

TL;DR
This paper derives a closed-form formula for the fractional moments of the quasi-stationary distribution of the Shiryaev diffusion on an interval, revealing its asymptotic behavior as the interval length grows.
Contribution
It provides the first analytical expression for fractional moments of the distribution for any real order, extending previous results for integer moments.
Findings
Fractional moments are explicitly derived for all real orders.
As A approaches infinity, the fractional moments converge to those of an exponential distribution.
The limiting distribution as A increases is the stationary distribution of the reciprocal process.
Abstract
We consider the quasi-stationary distribution of the classical Shiryaev diffusion restricted to the interval with absorption at a fixed . We derive analytically a closed-form formula for the distribution's fractional moment of an {\em arbitrary} given order ; the formula is consistent with that previously found by Polunchenko and Pepelyshev (2018) for the case of . We also show by virtue of the formula that, if , then the -th fractional moment of the quasi-stationary distribution becomes that of the exponential distribution (with mean ) in the limit as ; the limiting exponential distribution is the stationary distribution of the reciprocal of the Shiryaev diffusion.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic processes and financial applications · Bayesian Methods and Mixture Models · Financial Risk and Volatility Modeling
ANALYTIC EVALUATION OF THE FRACTIONAL MOMENTS FOR THE QUASI-STATIONARY DISTRIBUTION OF THE SHIRYAEV MARTINGALE ON AN INTERVAL
Kexuan Li
Aleksey S. Polunchenko
Department of Mathematical Sciences, State University of New York (SUNY) at Binghamton
Binghamtom, NY 13902–6000, USA
Andrey Pepelyshev
School of Mathematics, Cardiff University, Cardiff, CF24 4AG, UK
and
St. Petersburg State University, St. Petersburg, Russia, 199034
Abstract
We consider the quasi-stationary distribution of the classical Shiryaev diffusion restricted to the interval with absorption at a fixed . We derive analytically a closed-form formula for the distribution’s fractional moment of an arbitrary given order ; the formula is consistent with that previously found by Polunchenko and Pepelyshev (2018) for the case of . We also show by virtue of the formula that, if , then the -th fractional moment of the quasi-stationary distribution becomes that of the exponential distribution (with mean ) in the limit as ; the limiting exponential distribution is the stationary distribution of the reciprocal of the Shiryaev diffusion.
keywords:
Fractional moments, Markov diffusions, Quasi-stationarity, Shiryaev process, Stochastic processes
††journal: Communications in Statistics—Simulation and Computation
url]http://www2.math.binghamton.edu/p/people/grads/kli/start
url]http://people.math.binghamton.edu/aleksey
1 Introduction
This paper is an extension of the recent work of Polunchenko and Pepelyshev (2018) devoted to explicit analytic evaluation of
(a) the -th moment, with ; and (b) the Laplace transform of the quasi-stationary distribution of the classical Shiryaev diffusion restricted to an interval; the Shiryaev process in question and its quasi-stationary distribution are formally introduced below. The principal contribution of the present work is a closed-form formula for the quasi-stationary distribution’s fractional moment of any given order . Specifically, we consider the Shiryaev process defined as the solution of the stochastic differential equation
[TABLE]
where is standard Brownian motion in the sense that , , and ; the initial value is often referred to as the process’ headstart. The time-homogeneous Markov process is an important particular version of the so-called generalized Shiryaev process. The latter has been first arrived at and studied by Prof. A.N. Shiryaev—hence, the name—in his fundamental work on quickest change-point detection where the process is typically referred to as the Shiryaev–Roberts process; see, e.g., Shiryaev (1961, 1963). While the interest to the Shiryaev process in the context of quickest change-point detection has always been strong (see, e.g., Pollak and Siegmund 1985, Shiryaev 2002, Feinberg and Shiryaev 2006, Burnaev et al. 2009, Polunchenko 2016, 2017a, 2017b, 2017c), the process has received a great deal of attention in other areas as well, notably mathematical finance (see, e.g., Geman and Yor 1993, Donati-Martin et al. 2001, Linetsky 2004) and mathematical physics (see, e.g., Monthus and Comtet 1994, Comtet and Monthus 1996). It has also been studied in the literature on general stochastic processes (see, e.g., Wong 1964, Yor 1992, Donati-Martin et al. 2001, Dufresne 2001, Schröder 2003, Peskir 2006, Polunchenko and Sokolov 2016, Polunchenko et al. 2018).
The particular version of the generalized Shiryaev process governed by equation (1) is of special importance and interest because it is the only version with probabilistically nontrivial behavior in the limit as . This behavior is manifested through a certain invariant probability measure attained by the process (in the limit as ) in spite of the distinct martingale property for all and . Moreover, the process is convergent (as ) regardless of whether the state space is
(I) the entire half-line with no absorption on the interior; or (II) the interval with absorption at a given level ; or (III) the shortened half-line also with absorption at given. The case of a negative initial value was touched upon by Peskir (2006) who, in particular, showed that, if , then has no invariant probability measure on the negative half-line, and is simply bound (with unit probability) to eventually exit the region through the upper (exit) boundary. Cases (I), (II), and (III) have all been considered in the literature. Brief surveys of the corresponding results and relevant references were recently offered, e.g., by Polunchenko et al. (2018) and by Polunchenko and Pepelyshev (2018).
This work’s focus is on case (II). This case is of importance in quickest change-point detection, and in this context, it was investigated, e.g., by Pollak and Siegmund (1985), Burnaev et al. (2009) and by Polunchenko (2017a). See also, e.g., Pollak and Siegmund (1986), Linetsky (2004) and (Collet et al., 2013, Section 7.8.2). For a fixed and a given , the distribution of in this case is conditional on no absorption prior to . This distribution was derived explicitly, e.g., by Linetsky (2004) and by Polunchenko (2016). The limit of this distribution as is known as the quasi-stationary distribution. Formally, consider the stopping time
[TABLE]
where and are fixed; note that . The quasi-stationary distribution of is defined as
[TABLE]
and it does not depend on . The existence of this distribution was formally established, e.g., by Pollak and Siegmund (1985), although one can also infer the same conclusion, e.g., from the seminal work of Mandl (1961). Moreover, analytic closed-form formulae for and were recently obtained by Polunchenko (2017a), apparently for the first time in the literature; see formulae (11) and (12) in Section 3 below.
If , then case (II) becomes case (III), but the process still has a nontrivial probabilistic behavior in the limit as . This behavior is characterized by the invariant probability measure known as the stationary distribution. Formally, for the Shiryaev process (1) the latter is defined as
[TABLE]
and it, too, is independent of . This distribution has already been found, e.g., by Shiryaev (1961, 1963), Pollak and Siegmund (1985), Feinberg and Shiryaev (2006), Burnaev et al. (2009), and Polunchenko and Sokolov (2016), to be the momentless (no moments of orders one and higher) distribution
[TABLE]
which is an extreme-value Fréchet-type distribution, and a particular case of the inverse (reciprocal) Gamma distribution. See also, e.g., Linetsky (2004) and Avram et al. (2013). It is immediate to see from (4) that the stationary distribution of the reciprocal of the Shiryaev process is exponential with mean . The fact that converges to as was established, e.g., by Pollak and Siegmund (1985, 1986); the convergence is from above, and is pointwise, at every , i.e., at all continuity points of . The -to- convergence was recently investigated further by Li and Polunchenko (2019) who used the explicit formula obtained by Polunchenko (2017a) for to demonstrate that the uniform (in ) rate of convergence of down to as is no slower than .
While the stationary distribution (3)–(4) is momentless (no moments of orders one and higher, but moments of orders do exist), the quasi-stationary distribution (2) is not: its entire (positive-integer-order) moment series was recently evaluated explicitly by Polunchenko and Pepelyshev (2018) through the use of the closed-form formulae for and obtained earlier by Polunchenko (2017a). The contribution of this work is an explicit formula (in different equivalent forms) for the quasi-stationary distribution’s fractional moment of any given order ; the existence of the -th order fractional moment is shown formally as well. All this is done in Section 4, the core part of the paper. Since the -th fractional moment formula and its derivation involve certain special functions, the centerpiece of the paper is prefaced with Section 2 which introduces the relevant special functions. Section 3 conveniently summarizes the essential earlier results on the quasi-stationary distribution (2) of the Shiryaev process (1). Lastly, concluding remarks are provided in Section 5.
2 Notation and nomenclature
For convenience, standard mathematical notation will be used throughout the sequel. This includes the small array of special functions we shall deal with in the sections to follow. These functions, in their most common notation, are:
The Gamma function , , frequently also referred to as the extension of the factorial to complex numbers, due to the property exhibited for . See, e.g., (Bateman and Erdélyi, 1953, Chapter 1). 2. 2.
The digamma function , , defined as the logarithmic derivative of the Gamma function:
[TABLE]
see, e.g., (Bateman and Erdélyi, 1953, Section 1.7). 3. 3.
The Pochhammer symbol, or the rising factorial, often notated as and defined for and as
[TABLE]
and it is of note that for any . See, e.g., (Srivastava and Karlsson, 1985, pp. 16–18). Also, observe that
[TABLE]
and if is a negative integer or zero, i.e., if and , then
[TABLE]
cf. (Srivastava and Karlsson, 1985, p. 16–17). 4. 4.
The generalized hypergeometric function, denoted as , and defined via the power series
[TABLE]
where ; it is to be understood here that the argument and the numerator and denominator parameters, and , are such that the series on the right of (8) is convergent. At the very least, in view of (7), no , , should be a negative integer or zero. For more details on the convergence conditions, see, e.g., (Bateman and Erdélyi, 1953, Chapter 4) and (Srivastava and Karlsson, 1985, p. 20). The convergence question is trivial when one (or more) of the numeratorial parameters, say for some , is a negative integer or zero: in this case, in view of (7), the power series on the right of (8) terminates turning the function into a polynomial in of degree . Two special cases of the function will arise in the sequel: one with and one with . The function corresponding to the former case is known as the Kummer (1836) function. For more information on the Kummer function and its properties, see, e.g., Slater (1960) and (Buchholz, 1969, Chapter I, Section 1). 5. 5.
The Whittaker and functions, traditionally denoted, respectively, as and , where ; the Whittaker function is undefined when , but can be regularized. These functions were introduced by Whittaker (1904) as the fundamental solutions to the Whittaker differential equation. See, e.g., Slater (1960) and Buchholz (1969).
3 Preliminary background on the quasi-stationary distribution
As was mentioned in the introduction, the quasi-stationary distribution defined in (2) was recently expressed analytically by Polunchenko (2016); the expression involves the Whittaker function. Specifically, it can be deduced from (Polunchenko, 2016, Theorem 3.1) that if is fixed and is the smallest (positive) solution of the equation
[TABLE]
where
[TABLE]
then the quasi-stationary probability density function (pdf) is given by
[TABLE]
and the respective cumulative distribution function (cdf) is given by
[TABLE]
and and are each a smooth function of and ; observe from (9), (10), and (11) that . The smoothness of and is due to certain analytic properties of the Whittaker function present on the right of formulae (11) and (12). These formulae stem from the solution of a certain Sturm–Liouville problem, and is the smallest positive eigenvalue of the corresponding Sturm–Liouville operator; it should be noted that Polunchenko (2016) considered the negated Sturm–Liouville operator, causing to be its largest negative eigenvalue.
Remark 3.1**.**
The definition (10) of can actually be changed to with no effect whatsoever on either equation (9), or formulae (11) and (12), i.e., all three are invariant with respect to the sign of . This was previously pointed out by Polunchenko (2017a), and the reason for this -symmetry is because equation (9) and formulae (11) and (12) each have present only as (double) the second index of the corresponding Whittaker function or functions involved, and the Whittaker function in general is known (see, e.g., Buchholz 1969, Identity (19), p. 19) to be an even function of its second index, i.e., .
It is evident that equation (9) is a key ingredient of formulae (11) and (12), and consequently, of all of the characteristics of the quasi-stationary distribution as well. As a transcendental equation, it can only be solved numerically, although to within any desired accuracy; numerical analyses of equation (9) were previously carried out, e.g., by Linetsky (2004) and by Polunchenko (2016, 2017a, 2017b). Yet, it is known (see, e.g., Linetsky 2004 and Polunchenko 2016) that for any fixed , equation (9) has countably many simple solutions , such that . All of them depend on , but since we are interested only in the smallest one, we shall use either the “short” notation , or the more explicit notation to emphasize the dependence on . Also, it can be concluded from (Polunchenko, 2016, p. 136 and Lemma 3.3) that is a monotonically decreasing function of such that . More specifically, certain properties of the moments of the quasi-stationary distribution (11)–(12) necessitate that
[TABLE]
cf. (Polunchenko, 2017a, p. 136) and (Polunchenko and Pepelyshev, 2018, p. 1360).
4 The fractional moment formulae
Let be a random variable sampled from the quasi-stationary distribution given by (11)–(12). Fix and let denote the -th fractional moment of . It is apparent that for any , but all other ’s actually do depend on , and, if necessary, we shall stress this dependence via the notation . The object of this section—the centerpiece of this work—is to find analytically for any given and .
The obvious first question to ask is that of whether the ’s actually do exist for each fixed and . For the answer is clearly “yes”. For the answer is also in the affirmative inasmuch as the definition of and the explicit formula (11) for together yield
[TABLE]
and the latter integral is convergent, for
[TABLE]
which is a special case of the more general asymptotic result
[TABLE]
established, e.g., in (Whittaker and Watson, 1927, Section 16.3).
If , then it can be inferred from (Polunchenko, 2017a, Theorem 3.2) that, for fixed, the series satisfies the recurrence
[TABLE]
with for any ; recall that and are interconnected via equation (9). This recurrence was recently solved explicitly by Polunchenko and Pepelyshev (2018).
Lemma 4.1** (Polunchenko and Pepelyshev 2018).**
For every fixed, the solution to the recurrence (14) is given by
[TABLE]
where is determined by (9) while is defined in (10); recall also that denotes the corresponding case of the generalized hypergeometric function (8).
It stands to mention that since in (15) is a nonnegative integer, the right hand side of (15) is actually a polynomial of degree in . See (Polunchenko and Pepelyshev, 2018, Formula (15), p. 1359). However, while it is essential for the validity of formula (15) that in it be a nonnegative integer, the recurrence (14) itself holds true even if is an arbitrary real number. This conclusion can be reached from a more careful analysis of the steps taken by Polunchenko (2017a) to arrive at a slightly more general version of the recurrence (14), plus the fact established above that the quasi-stationary distribution (11)–(12) has all fractional moments, of any real order. Thus the fractional moments follow the recurrence
[TABLE]
where for any , and our goal of finding explicitly becomes a matter of solving (16) analytically. To this end, even though (16) was obtained from (14) by (justifiably) replacing in the latter with , the same “trick” performed on the solution (15) to recurrence (14) will only yield a particular solution to (16), but not the complete solution. Moreover, formula (15) was arrived at by Polunchenko and Pepelyshev (2018) as a particular solution to (14) that just happens to satisfy the condition , so there was no need to find the general solution to the homogeneous version of (14). We shall now fill in this gap, i.e., make the obvious first step toward solving (16) completely.
Lemma 4.2**.**
For every fixed, the general solution to the recurrence (16) is given by
[TABLE]
where is an arbitrary constant (independent of but possibly dependent on ), and is determined by (9) while is defined in (10); recall also that denotes the corresponding case of the generalized hypergeometric function (8).
Proof.
The right-hand side of formula (17) has a fairly transparent structure: it is a superposition of a particular solution to (16) and the general solution to the homogeneous version of (16). The particular solution is given by the term on the right of (17), and it was already demonstrated explicitly by Polunchenko and Pepelyshev (2018) that it “works”. To see that the second term (proportional to ) on the right of (17) is, indeed, the general solution to the homogeneous version of (16), it suffices to iterate the corresponding first-order recurrence forward in , and then simplify the resulting expression using (10) and the well-known factorial property of the Gamma function, i.e., for any . ∎
The question now is how to “pin down” the constant on the right in (17). Given that (16) is an order-one nonhomogeneous difference equation, the question would be trivial if we had computed as a function of for at least one particular . To this end, setting seems the obvious first choice, because for any . However, setting in the general solution (17) results in for any . The reason is because the denominator of the second term on the right of (17) is which has a (simple) pole at , so that, no matter what is, the entire second term is nullified, reducing (17) down to (15). This circumstance renders the condition practically useless for the purposes of finding . As a matter of fact, the same happens for any , inasmuch as the Gamma function is known to have (simple) poles at all nonnegative-integer values of the argument. This explains why Polunchenko and Pepelyshev (2018), in their derivation of solution (15) to recurrence (14), never had to find the general solution to the homogeneous version of (14): even if this general solution were found, it would have ended up evaluating to zero for any anyway, entirely independently of arbitrary constant factor. The “uselessness” of the condition is easily overcome by the following observation: since
[TABLE]
because of (10), we immediately obtain from (16) that
[TABLE]
where the second equality is, again, due to (10). In other words, some of the ’s are given away “for free” by the recurrence (16) itself. Alternatively, from (11) and the definition of we have
[TABLE]
where the last equality is because of (Prudnikov et al., 1990, Identity 2.19.5.6, p. 217), i.e., the definite integral identity
[TABLE]
valid as long as , , and . Next, from the identity
[TABLE]
which is (Slater, 1960, Identity (2.4.22), p. 25) and (Slater, 1960, Identity (2.4.23), p. 25) combined, we obtain
[TABLE]
whence
[TABLE]
because
[TABLE]
as is implied by (Slater, 1960, Identity (2.4.24), p. 25), i.e., the identity
[TABLE]
and condition (9). Now, putting all of the above together (18) follows.
To be able use (17) and (18) to get in as simple a form as possible, it is necessary to first evaluate (17) for , and “massage” the result so as to bring it to a suitable form. One way to achieve this is through the identity
[TABLE]
which is one of the contiguous relations that the function is known to satisfy. Since from (8) it is easy to see that
[TABLE]
the above contiguous relation leads to
[TABLE]
so that subsequently from (23) we find
[TABLE]
whence, in view of (18), we finally obtain
[TABLE]
where again is determined by (9) and is defined in (10).
We have obtained not one but two expressions for , although the difference between the expressions is only in the sign of . The two expressions are equivalent, and, in a nutshell, it has to do with condition (9) and the symmetry of the Whittaker function with respect to its second index, i.e., . To give a more detailed explanation we turn to (Abramowitz and Stegun, 1964, Identity 13.1.34, p. 505), i.e., the identity
[TABLE]
where denotes the Whittaker function. On account of (9) this identity gives
[TABLE]
or equivalently
[TABLE]
because, again, for any . Next, since the Whittaker function and the Kummer function are related, and the relationship is given by (Buchholz, 1969, Identities (3a) and (4), p. 11), which together amount to
[TABLE]
we further obtain
[TABLE]
and it is now evident that the sign of on the right of (21) is, indeed, irrelevant.
It is possible to simplify (21) considerably, with the help of the identity
[TABLE]
which was established by (Polunchenko and Pepelyshev, 2018, p. 1373). In view of this identity and the earlier discussion one can see that
[TABLE]
whence
[TABLE]
i.e., is precisely the normalizing factor in formulae (11)–(12) for the quasi-stationary pdf and cdf . Other than being far more compact than (21), the new formula for also makes it clear that the latter is invariant with respect to the sign of : the reason is the symmetry of the Whittaker function with respect to its second index, i.e., . Moreover, since
[TABLE]
as was previously shown by (Polunchenko, 2016, p. 139), it can be readily concluded that .
We are now ready to state our main result.
Lemma 4.3**.**
For every fixed, the fractional moment of order of the quasi-stationary distribution (11)–(12) is given by
[TABLE]
where
[TABLE]
with determined by (9) and defined in (10); recall also that denotes the corresponding special case of the generalized hypergeometric function (8).
The obtained -th order fractional moment formula (23) is valid even if for , so long as it is used with care: the first term on the right of (23) experiences a singularity at , and one of the two Gamma functions in the second term on the right of (23) experiences a (simple) pole whenever . To better understand how to use formula (23) for , suppose, for simplicity, that , and consider for some sufficiently small . For this choice of from (23) and (24) and repeated use of , , we have
[TABLE]
so that each of the two terms inside the braces can be seen to have a singularity of type as . However, these singularities “undo” each other in such a way that
[TABLE]
ends up being well-defined. To show this and properly handle the singularities observe first that
[TABLE]
whence on account of
[TABLE]
and recalling (6) and (8) one can conclude that
[TABLE]
where denotes the digamma function (5); recall again that is determined by equation (9).
The case of , , can be treated in a similar manner, leading to
[TABLE]
where is as in (25).
5 Concluding remarks
It was pointed out in the introduction that the quasi-stationary distribution (2) of the Shiryaev process (1) converges (weakly) to the process’ stationary distribution (3), as ; see Pollak and Siegmund (1985, 1986) and Li and Polunchenko (2019). Since the stationary distribution—explicitly given by (4)—has fractional moments of orders only, it stands to reason that, unless , the -th fractional moment of the quasi-stationary distribution (11)–(12) is divergent as . This can be verified by computing the corresponding limit explicitly through the -th fractional moment formula (23). Specifically, recalling (13) whereby \lim_{A\to+\infty}\big{(}A\,\lambda_{A}^{1+\delta}\big{)}=0 for any but , so that owing to (10), we find
[TABLE]
where the second equality follows directly from the definition (8) of the generalized hypergeometric function. Hence, from the -th fractional moment formula (23) we obtain
[TABLE]
where we also used mentioned above. The obtained result is precisely the fractional moment of order of the stationary distribution (4), because of (Gradshteyn and Ryzhik, 2014, Identity 3.381.4, p. 348) which states that
[TABLE]
provided and .
As a final comment, we note that the -th fractional moment formula (23) can also be used to compute the -moment of the quasi-stationary distribution. Specifically, if is, again, a random variable having the quasi-stationary distribution given by (11), then
[TABLE]
where is as in (23) and is determined by (9).
Acknowledgements
The authors are thankful to the two anonymous referees whose constructive feedback helped improve the quality of the manuscript.
The effort of A.S. Polunchenko was supported, in part, by the Simons Foundation via a Collaboration Grant in Mathematics under Award # 304574.
The work of A. Pepelyshev was partially supported by the Russian Foundation for Basic Research under Project # 17-01-00161.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Abramowitz and Stegun (1964) M. Abramowitz and I. Stegun, editors. Handbook of Mathematical Functions with Formulas, Graphs, and Mathematical Tables , volume 55 of Applied Mathematics Series . United States National Bureau of Standards, tenth edition, 1964.
- 2Avram et al. (2013) F. Avram, N. N. Leonenko, and N. Šuvak. On spectral analysis of heavy-tailed Kolmogorov–Pearson diffusions. Markov Processes and Related Fields , 19(2):249–298, 2013.
- 3Bateman and Erdélyi (1953) H. Bateman and A. Erdélyi. Higher Transcendental Functions , volume 1. Mc Graw-Hill, New York, NY, 1953.
- 4Buchholz (1969) H. Buchholz. The Confluent Hypergeometric Function , volume 15 of Springer Tracts in Natural Philosophy . Springer-Verlag, New York, NY, 1969. Translated from German by H. Lichtblau and K. Wetzel.
- 5Burnaev et al. (2009) E. V. Burnaev, E. A. Feinberg, and A. N. Shiryaev. On asymptotic optimality of the second order in the minimax quickest detection problem of drift change for Brownian motion. Theory of Probability and Its Applications , 53(3):519–536, 2009. doi: 10.1137/S 0040585 X 97983791 .
- 6Collet et al. (2013) P. Collet, S. Martínez, and J. San Martín. Quasi-Stationary Distributions Markov Chains, Diffusions and Dynamical Systems . Probability and Its Applications. Springer, New York, NY, 2013.
- 7Comtet and Monthus (1996) A. Comtet and C. Monthus. Diffusion in a one-dimensional random medium and hyperbolic Brownian motion. Journal of Physics A: Mathematical and General , 29(7):1331–1345, 1996. doi: 10.1088/0305-4470/29/7/006 .
- 8Donati-Martin et al. (2001) C. Donati-Martin, R. Ghomrasni, and M. Yor. On certain Markov processes attached to exponential functionals of Brownian motion: Application to Asian options. Revista Matemática Iberoamericana , 17(1):179–193, 2001. doi: 10.4171/RMI/292 .
