Grenander functionals and Cauchy's formula
Piet Groeneboom

TL;DR
This paper extends the analysis of the Grenander estimator's properties using complex analysis techniques, correcting previous results and deriving new asymptotic distributions for integrals of the estimator.
Contribution
It introduces a general framework for analyzing Grenander functionals using Cauchy's formula, correcting earlier inaccuracies and expanding asymptotic results.
Findings
Extended the asymptotic distribution results for Grenander estimator integrals.
Corrected previous inaccuracies in the distribution of sums of gamma and Poisson variables.
Applied saddle point methods and Cauchy's formula to nonparametric estimator analysis.
Abstract
Let be the nonparametric maximum likelihood estimator of a decreasing density. Grenander characterized this as the left-continuous slope of the least concave majorant of the empirical distribution function. For a sample from the uniform distribution, the asymptotic distribution of the -distance of the Grenander estimator to the uniform density was derived in Groeneboom and Pyke (1983) by using a representation of the Grenander estimator in terms of conditioned Poisson and gamma random variables. This representation was also used in Groeneboom and Lopuhaa (1993) to prove a central limit result of Sparre Andersen on the number of jumps of the Grenander estimator. Here we extend this to the proof of a general result on integrals of the Grenander estimator. We also correct Groeneboom and Pyke (1983), where the limit distribution of the sums of gamma and Poisson variables on…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Grenander functionals and Cauchy’s formula
Piet Groeneboomlabel=e1][email protected] label=u1 [[
url]http://dutiosc.twi.tudelft.nl/~pietg
Delft University of Technology, Building 28, Van Mourik Broekmanweg 6, 2628 XE Delft, The Netherlands.
Abstract
Let be the nonparametric maximum likelihood estimator of a decreasing density. Grenander characterized this in [6] as the left-continuous slope of the least concave majorant of the empirical distribution function. For a sample from the uniform distribution, the asymptotic distribution of the -distance of the Grenander estimator to the uniform density was derived in [12] by using a representation of the Grenander estimator in terms of conditioned Poisson and gamma random variables. This representation was also used in [8] to prove a central limit result of Sparre Andersen [19] on the number of jumps of the Grenander estimator. Here we extend this to the proof of the main result in [12] and also prove a similar asymptotic normality results for the entropy functional. In [12] the limit distribution of the sums of gamma and Poisson variables on which the conditioning was done did not have the right form, which is corrected here. Cauchy’s formula and saddle point methods are the main tools in our development.
62E20, ,
62G05,
Grenander estimator, integral statistics, saddle points, Cauchy’s formula,
keywords:
[class=AMS]
keywords:
\startlocaldefs\setattribute
journalname \endlocaldefs
t2This manuscript is dedicated to the memory of Ronald Pyke
1 Introduction
The Grenander estimator is the (nonparametric) maximum likelihood estimator (MLE) of a monotone decreasing density. It was introduced in [6], where it was proved that it is the left-continuous slope of the least concave majorant of the empirical distribution function. Some properties and limit results are discussed in [10] and also in [11] in the special issue on nonparametric inference under shape constraints of the journal Statistical Science. The Grenander estimator is shown in Figure 1 for a sample of size from the uniform distribution on . It can be improved by using boundary penalties (in fact, the estimator is inconsistent at the boundary points [math] and ), but this is not the concern of the present paper.
The Grenander estimator is a piecewise constant function with downward jumps at locations which correspond to the changes of slope (“kinks”) of the least concave majorant of the empirical distribution function. Although the Grenander estimator is defined as the left-continuous slope of the empirical distribution function, we can make the Grenander estimator right-continuous by taking the limits on the right at its points of jump. This does not change the probability mass of the induced (absolutely continuous) probability distribution, which is absolutely continuous w.r.t. Lebesgue measure.
The number of jumps of the Grenander estimator is of order if the sample is from a uniform distribution (see Section 2), if the sample comes from a strictly decreasing smooth density like the exponential density, then the number of jumps is of order , for some constant . The limit behavior of the Grenander estimator for these situations is rather different. For a sample from the uniform distribution, we have for :
[TABLE]
where is the slope of the least concave majorant of the standard Brownian Bridge on , see Remark 2.2, p. 543 of [7]. The density of is a function of the standard normal distribution function and the standard normal density , see (3.11) in [9].
In contrast with the result (1.1), we get for a sample from a decreasing density on at a point , where is differentiable and , the following result, due to Prakasa Rao in [16]:
[TABLE]
where , that is: is the (almost surely unique) location of the maximum of two-sided Brownian motion minus the parabola . For further details, see, e.g., [10] and [11].
Recently, integrated functionals of a monotone density were studied in [14]. Using the same notation as in [14] the following functionals were studied:
[TABLE]
where is a nonincreasing function on and satisfies some regularity conditions. In the case that the underlying distribution is uniform, the following central limit result is proved:
Theorem 1.1** (Theorem 3.2 in [14]).**
Let and let . Then:
[TABLE]
where denotes the standard normal distribution.
We prove an analogous result for analytic functions , defined on the positive open complex half plane. So our functions have a lot more smoothness, but are on the other hand defined on the open complex half plane, which makes the result applicable to functions that are not covered by the conditions in [14]. We assume that satisfies the following condition.
Condition 1.1**.**
The function is analytic on the complex half plane , and satisfies the following conditions.
- (i)
. 2. (ii)
For :
[TABLE]
and 3. (iii)
For :
[TABLE]
We now have the following result.
Theorem 1.2**.**
Let satisfy Condition 1.1 and let be the Grenander estimator. Then
[TABLE]
where denotes the standard normal distribution.
The result is a corollary to Theorem 4.1 in Section 4, see the remark at the end of Section 4.
Examples of the application of Theorem 1.2 are:
Example 1.1**.**
Let . Then and . The function is obviously analytic on the positive complex half plane. Condition 1.1 is fulfilled, so we get
[TABLE]
This is the main result of [12]. Since in [14] Theorem 3.2 is deduced from this result, we also get Theorem 3.2 in [14] back from Theorem 1.2. **
Example 1.2**.**
Let . Then and . The function is again analytic on the positive complex half plane. Condition 1.1 is fulfilled, and we get:
[TABLE]
For this example the conditions of Theorem 3.2 in [14] are not satisfied. The result follows from Theorem 1.2 and can be applied to the theory on a likelihood ratio test for monotonicity in [2]. **
To derive limit results for the uniform distribution, a special representation in terms of gamma and Poisson random variables was given in [12], with the purpose of proving a limit result for a two-sample rank statistic introduced in the dissertation of [1] and also for a test statistic in the combination of tests in [18]. We describe this representation now.
Let be a sample from the uniform distribution, and let be the locations of the jumps of the Grenander estimator for this sample, augmented with the points [math] and . Note that , , , , are the successive intervals of constancy of the Grenander estimator if we take the estimator to be left-continuous.
Furthermore, let , and be defined by:
[TABLE]
where is the empirical distribution function of the sample , and where is the number of jumps of the Grenander estimator.
Next, let be independent Poisson random variables with , and let, for each , be a collection of independent gamma random variables, independent of the , where is Gamma (the sum of independent standard exponentials). We define:
[TABLE]
and
[TABLE]
Note that there are induced spacings (intervals of constancy of ) of length , induced spacings (intervals of constancy of ), consisting of two consecutive original spacings between locations of jumps of the least concave majorant, etc., where can be zero.
We now have the following representation theorem:
Theorem 1.3** (Theorem 2.1 in [12]).**
[TABLE]
Remark 1.1**.**
For specificity, the random variables , and , are ordered in Theorem 2.1 in [12]. There is also a zero-step spacing introduced in [12], but this does not seem to be necessary. **
Using this representation, we can reduce the proofs of the limit behavior of global functionals of the Grenander estimator to a theorem for gamma and Poisson random variables, under the condition . For convenience in later proofs, we also use a further standardization of :
[TABLE]
where and are defined by (1.9). A conditional central limit theorem for functionals of the Grenander estimator can then be proved under the condition:
[TABLE]
The infinitely divisible limit distribution of the pair was given in Lemma 3.1 of [12], but unfortunately Lemma 3.1 of [12] contains a rather silly error (the in the representation of the characteristic function should be ). The correct version of this result is given in Lemma 3.1 below, where also the origin of the error is explained. The proof in [8] does not use the result on the limit distribution of , so is not influenced by the erroneous Lemma 3.1 in [12].
We use methods different from those in [12]. The conditional central limit theorem was proved in [12] using Le Cam’s paper [13] (a paper apparently published without his permission, as became clear in a conversation of the author with him). In the present context, where we clearly have to deal with non-standard asymptotics, [13] does not seem to be the right tool to use. We replace this by a direct analysis of the characteristic function. The crucial tools here are Cauchy’s formula and saddle point methods, using contour integration in the complex plane. To illustrate our method in a simple setting, we give a shortened version of the proof in [8] of Sparre Andersen’s result [19] in Section 2.
2 Sparre Andersen’s result
To illustrate our method in the simplest setting, we give a short version of the proof in [8] of the following remarkable result of Sparre Andersen in [19].
Theorem 2.1** (Sparre Andersen’s result).**
Let be a sample from the Uniform distribution and let be the number of jumps of the Grenander estimator for this sample. Then
[TABLE]
where is the standard normal distribution.
Proof.
Let, for the sample , be defined by:
[TABLE]
and let be defined as in (1.9). Using (part of) the representation, introduced in Section 1, we only have to prove that tends in law to a standard normal distribution. To this end we consider the conditional characteristic function
[TABLE]
Lemma 3.2 of [12] implies:
[TABLE]
Hence we get, by Fourier inversion and using the notation ,
[TABLE]
Denoting the contour by , we write this in the form
[TABLE]
where is given by:
[TABLE]
The expression in (2.1), multiplied by , is by Cauchy’s formula equal to the coefficient of in the power series around of the function:
[TABLE]
Comparing this with the power series of the function , we see that the coefficient of is the same in both series. This coefficient is:
[TABLE]
So we get:
[TABLE]
∎
3 The limit distribution of the conditioning variables
Let the pair be defined by (1.10). We prove the following lemma, which corrects Lemma 3.1 in [12].
Lemma 3.1**.**
The pair converges in distribution to , where has the infinitely divisible characteristic function
[TABLE]
Proof.
We have:
[TABLE]
where is the characteristic function of the centered gamma variable , see (3.9) of [12].
Writing , and noting that for :
[TABLE]
and that the limit of the expression on the left for is equal to:
[TABLE]
we can write the exponent in the form
[TABLE]
(it is here that the mistake was made in [12], in the formula after (3.9) on p. 333), so we get a Riemann sum converging to the integral
[TABLE]
The infinite divisibility of the limit distribution is shown below. ∎
Remark 3.1**.**
In [12] first the limit distribution of is computed, using moment conditions (going up to the th moments). Next the limit distribution of is computed and it is stated that this distribution is infinite divisible and has no normal component, implying that therefore the limit has to be independent of the limit of .
The in the exponent of the characteristic function of the limit distribution of in Lemma 3.1 of [12] should be . The incorrect arose on p. 333 of [12], where the limit of the characteristic function of the rescaled gamma random variable was given by instead of . This also invalidates the ensuing remarks on p. 333 of [12]. We correct these remarks below. **
The distribution of is infinitely divisible, as we now show. A general characterization of infinitely divisible distributions in is given in [17] and given below for convenience.
Theorem 3.1** (Theorem 1.3 in [17], Lévy-Khintchine representation).**
If the distribution is infinitely divisible, then its characteristic function is given by:
[TABLE]
where is a symmetric nonnegative-definite matrix, is the Euclidean norm, is a measure on satisfying , , and where . The representation (3.2) by and is unique. Conversely, for any choice of and satisfying the conditions above, there exists an infinite divisible distribution having characteristic function (3.2).
In the present situation we can take the matrix with zeroes, and define by the density
[TABLE]
where is the standard normal density. With these choices of , and we get, using the notation and ,
[TABLE]
We note that the computer package Mathematica evaluates the characteristic function for in the following way:
[TABLE]
where is Euler’s gamma and is the complementary incomplete gamma function, defined by:
[TABLE]
see, e.g., (2.01) on p. 109 of [15].
4 Central limit theorem for
In this section, we use the notation
[TABLE]
Using the conditioning of Section 1, the statistic has the following representation:
[TABLE]
where the Poisson random variables and the gamma random variables are defined as in (1.9), and where we condition on , where is defined by (1.10). We define
[TABLE]
Remark 4.1**.**
The terms are present in as variance reducing terms and give, after the summation over and , a zero contribution to if . Also note that
[TABLE]
if the condition is satisfied. **
We assume that the function satisfies condition 1.1 and first consider the conditional density of , given .
Lemma 4.1**.**
The conditional density of , given , is the density of a centered and standardized Gamma variable:
[TABLE]
The density converges uniformly to the standard normal density, as .
Proof.
By Fourier inversion, the conditional characteristic function is given by:
[TABLE]
where , see (3). Denoting the contour by , we write this in the form
[TABLE]
where is given by:
[TABLE]
and where we use:
[TABLE]
by Lemma 3.2 of [12]. An application of Cauchy’s formula yields:
[TABLE]
where we use that the coefficient of in the power series around of the function is the same as the coefficient of in the power series of the function .
Hence
[TABLE]
This is just the characteristic function of the sum of standardized exponential variables, and hence its density tends uniformly to the standard normal density by Theorem 2 on p. 516 of [5]. ∎
So, in particular, we get:
[TABLE]
where is Euler’s gamma. The characteristic function of is therefore given by
[TABLE]
We now consider the characteristic function , defined by:
[TABLE]
which involves the components of the random variables and . Its asymptotic behavior is determined using a saddle point method.
Lemma 4.2**.**
The characteristic function (4.5) satisfies
[TABLE]
uniformly for in a bounded interval.
Proof.
After a change of variables, can be written:
[TABLE]
where is defined on the right half plane by:
[TABLE]
The derivative of is given by
[TABLE]
A saddle point is given by the equation
[TABLE]
Multiplying both sides of this equation with , we get the equation
[TABLE]
This equation has, for sufficiently large , a unique solution in a neighborhood of , as is clear from the following properties.
- (i)
[TABLE] 2. (ii)
For we have:
[TABLE]
see, e.g., (3.3), p. 56 of [4]. So the saddle point is given by a solution of the equation (4.9) and can be found by the simple iteration , starting at . We can, however, also take itself instead of the real saddle point for the asymptotic expansion, since this gives us the same terms in the expansion we need.
The value of at has the following expansion:
[TABLE]
Furthermore,
[TABLE]
so
[TABLE]
It follows that we get:
[TABLE]
where is a complex number with absolute value 1, corresponding to the argument of the main axis of the saddle point (note that the argument of this axis is , see [3], p. 84).
Evaluating the integrand at , and applying Stirling’s formula on , we obtain the following asymptotic representation:
[TABLE]
see, e.g., [3], (5.10.3) on p. 92 for the first asymptotic equivalence. The second asymptotic equivalence holds uniformly for in a bounded interval. Note that this corresponds to changing the path of integration for to a path in the complex plane, going through the saddle point. ∎
We can now prove the following property of the characteristic function of . This is “almost” the Fourier inversion for , but we still have to extend the inversion for to the whole real line. Cauchy’s formula is an essential ingredient of the proof of Lemma 4.3.
Lemma 4.3**.**
Let be defined by (4.2) and let satisfy condition 1.1. Then, for each :
[TABLE]
where is Euler’s gamma.
Proof.
Let be defined by (4.5). Since, by (4.1) and (4.2),
[TABLE]
where
[TABLE]
we get, evaluating the probabilities for the Poisson random variables in the third line:
[TABLE]
As in the proof of Lemma 4.1 we consider the contour and denote this contour by . So, integrating (4) w.r.t. and changing variables we get:
[TABLE]
where
[TABLE]
and
[TABLE]
and where is a complex number of absolute value 1, corresponding to the angle of the main axis through the (approximate) saddle point .
So we get, by Cauchy’s formula, for and arbitrarily large,
[TABLE]
Thus:
[TABLE]
∎
We still have to prove that the remaining part of integral w.r.t. the integration variable can be made arbitrarily small by choosing large. To this end we split the remaining region into two regions: and . This split-up is familiar from inversion theorems for densities, see, e.g., the proof of Theorem 2 on p. 516 of [5]. We start with the region .
Lemma 4.4**.**
Let satisfy condition 1.1. Then there exists for each an and such that
[TABLE]
Proof.
We consider again the expansion of the function defined by (4.7) at the point . We get:
[TABLE]
where is a point on the line segment between and . Likewise
[TABLE]
where is a point on the line segment between and .
So we have a local expansion
[TABLE]
as in (4). This means that we can follow the same steps as in the proof of Lemma 4.3 and that we can choose and in such a way that
[TABLE]
∎
The following lemma deals with the region .
Lemma 4.5**.**
Let satisfy condition 1.1. Then, for each :
[TABLE]
Proof.
We consider the characteristic function:
[TABLE]
so we replace by in (4.5). This means that, for the saddle point analysis, the constant is replaced by in the function (4.7). So we now define
[TABLE]
The saddle point equation (4.9) now turns into
[TABLE]
and has a unique solution in a neighborhood of for the same reasons as before.
We define:
[TABLE]
Then:
[TABLE]
and
[TABLE]
uniformly in , using Condition 1.1. This implies that, uniformly for ,
[TABLE]
where is a complex number with absolute value 1 and argument .
So we find, using Cauchy’s formula again, as in the proof of Lemma 4.3, and using Condition 1.1,
[TABLE]
where
[TABLE]
and
[TABLE]
Hence
[TABLE]
implying
[TABLE]
for large , uniformly for , using Condition 1.1. It now follows that
[TABLE]
∎
This leads to the main result of this section.
Theorem 4.1**.**
Let satisfy Condition 1.1. Then converges in law to a standard normal distribution.
Proof.
The preceding lemma’s imply
[TABLE]
where is Euler’s gamma. Hence
[TABLE]
∎
Theorem 1.2 now follows from the conditional representation from Section 1, definition (4.2) of , Remark 4.1 and Theorem 4.1.
5 Conclusion
We derived a general theorem (Theorem 1.2) for integrals of the Grenander estimator when the distribution is uniform from a representation in terms of Poisson and gamma random variables in [12]. The result implies the main result of [12] and gives also the limit behavior of the entropy functional. We corrected the limit distribution of the conditioning variables given in Lemma 3.1 of [12] in Section 3. The methods used are rather different from the methods in [12], where a result in [13] was used.
Our main result was inspired by [14] who derived a similar result from [12] under different conditions. The main tools are Cauchy’s formula and the saddle point method for integrals of analytic functions of a complex variable. A simple version of the approach is given in Section 2 to illustrate the method without the complications of the saddle point method.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Behnen [1974] K. Behnen. Güteeigenschaften von Rangtests unter Bindungen . Habilitationsschrift, University of Freiburg, 1974.
- 2Chan et al. [2018] K. C. G. Chan, C-F Tang, and S.C.P Yam. Likelihood ratio test for monotonicity of density. Submitted, 2018.
- 3de Bruijn [1981] N. G. de Bruijn. Asymptotic methods in analysis . Dover Publications, Inc., New York, third edition, 1981. ISBN 0-486-64221-6.
- 4Dieudonné [1968] Jean Dieudonné. Calcul infinitésimal . Hermann, Paris, 1968.
- 5Feller [1971] William Feller. An introduction to probability theory and its applications. Vol. II . Second edition. John Wiley & Sons, Inc., New York-London-Sydney, 1971.
- 6Grenander [1956] Ulf Grenander. On the theory of mortality measurement. II. Skand. Aktuarietidskr. , 39:125–153 (1957), 1956.
- 7Groeneboom [1985] P. Groeneboom. Estimating a monotone density. In Proceedings of the Berkeley conference in honor of Jerzy Neyman and Jack Kiefer, Vol. II (Berkeley, Calif., 1983) , Wadsworth Statist./Probab. Ser., pages 539–555, Belmont, CA, 1985. Wadsworth.
- 8Groeneboom and Lopuhaä [1993] P. Groeneboom and H. P. Lopuhaä. Isotonic estimators of monotone densities and distribution functions: basic facts. Statist. Neerlandica , 47:175–183, 1993. ISSN 0039-0402. 10.1111/j.1467-9574.1993.tb 01415.x . URL http://dx.doi.org/10.1111/j.1467-9574.1993.tb 01415.x . · doi ↗
