A Simple Point Estimator of the Power of Moments
Shuhua Chang, Deli Li, Yongcheng Qi, Andrew Rosalsky

TL;DR
This paper introduces a simple estimator for the power of moments of a distribution, analyzes its asymptotic properties, and provides methods for hypothesis testing and practical examples.
Contribution
It proposes a novel, straightforward estimator for the power of moments and investigates its consistency and asymptotic behavior under broad conditions.
Findings
The estimator converges in probability to the true power of moments.
Consistency holds under mild tail conditions on the distribution.
Provides a formula for p-value calculation in hypothesis testing.
Abstract
Let be an observable random variable with unknown distribution function , and let \[\ \theta = \sup\left \{ r \geq 0:~ \mathbb{E}|X|^{r} < \infty \right \}. \] We call the power of moments of the random variable . Let be a random sample of size drawn from . In this paper we propose the following simple point estimator of and investigate its asymptotic properties: \[ \hat{\theta}_{n} = \frac{\log n}{\log \max_{1 \leq k \leq n} |X_{k}|}, \] where . In particular, we show that \[ \hat{\theta}_{n} \rightarrow_{\mathbb{P}} \theta~~\mbox{if and only if}~~ \lim_{x \rightarrow \infty} x^{r} \mathbb{P}(|X| > x) = \infty ~~\forall~r > \theta. \] This means that, under very reasonable conditions on , is…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsProbability and Risk Models · Statistical Distribution Estimation and Applications · Bayesian Methods and Mixture Models
A Simple Point Estimator of the Power of Moments
Shuhua Chang***Shuhua Chang, Research Center for Mathematics and Economics, Tianjin University of Finance and Economics, Tianjin 300222, China e-mail: [email protected], Deli Li†††Deli Li, Department of Mathematical Sciences, Lakehead University, Thunder Bay, ON P7B 5E1, Canada e-mail: [email protected], Yongcheng Qi‡‡‡Yongcheng Qi, Department of Mathematics and Statistics, University of Minnesota Duluth, Duluth, MN 55812, USA e-mail: [email protected], and Andrew Rosalsky§§§Andrew Rosalsky, Department of Statistics, University of Florida, Gainesville, Florida 32611, USA e-mail: [email protected]¶¶¶Corresponding author.
Abstract
Let be an observable random variable with unknown distribution function , and let
[TABLE]
We call the power of moments of the random variable . Let be a random sample of size drawn from . In this paper we propose the following simple point estimator of and investigate its asymptotic properties:
[TABLE]
where . In particular, we show that
[TABLE]
This means that, under very reasonable conditions on , is actually a consistent estimator of . Hypothesis testing for the power of moments is conducted and, as an application of our main results, the formula for finding the p-value of the test is given. In addition, a theoretical application of our main results is provided together with three illustrative examples.
MSC (2010): 62F10, 60F15, 62F12.
Keywords Asymptotic theorems Consistent estimator Maxima sequence Point estimator Power of moments
1 Motivation
The motivation of the current work arises from the following problem concerning parameter estimation. Let be an observable random variable with unknown distribution function , and let
[TABLE]
We call the power of moments of the random variable . Clearly is a parameter of the distribution of the random variable . Now let be a random sample of size drawn from random variable ; i.e., are independent and identically distributed (i.i.d.) random variables whose common distribution function is . It is natural to pose the following question: Can we estimate the the parameter based on the random sample , …, ?
This is a serious and important problem. For example, if and if the distribution of is nondegerate, then it is clear that and so by the classical Lévy central limit theorem, the distribution of
[TABLE]
is approximately normal (for all sufficiently large ) with mean [math] and variance where . Thus the problem that we are facing is how can we conclude with a high degree of confidence that .
In this paper we propose the following point estimator of and will investigate its asymptotic properties:
[TABLE]
Here and below .
Our main results will be stated in Section 2 and they all pertain to a sequence of i.i.d. random variables drawn from the distribution function of the random variable . The procedure of our study is as follows.
Step 1. For deducing the asymptotic properties of , , we will first precisely determine the values of such that
[TABLE]
where and ; see Theorems 2.1 and 2.2.
Step 2. Following from Theorems 2.1 and 2.2, in Theorem 2.3 we will provide different necessary and sufficient conditions for
[TABLE]
Step 3. Under the assumption that (1.1) holds for some , in Theorem 2.4 we will establish large deviation probabilities for
[TABLE]
Step 4. Under some reasonable conditions on , in Theorem 2.5 we will obtain a result on convergence in distribution for , .
Step 5. Replacing by and following from Theorems 2.1-2.5, in Theorem 2.6 we will state a set of asymptotic properties of , . In particular, one of them asserts that
[TABLE]
if and only if
[TABLE]
where “” stands for convergence in probability. If (1.2) holds for some , we will see from Theorem 2.6 that
[TABLE]
This means that, under very reasonable conditions on , is not only a consistent estimator of but also possesses a very good convergence rate.
The proofs of our main results will be provided in Section 3. As one can see from Section 3, the proofs of the main results are simple since only some basic results (such as the Borel-Cantelli lemma) in probability theory are used. We refer the reader to Chow and Teicher (1997) for any basic results in probability theory that are used in this paper.
In Section 4 hypothesis testing for the power of moments is conducted and, as an application of our main results, the formula for finding the p-value of the test is given. In addition, a theoretical application of our main results will be provided in Section 5 together with three illustrative examples.
2 Statement of the main results
Throughout, is a random variable with unknown distribution , and write
[TABLE]
Clearly, just as as defined in Section 1 is a parameter of the distribution of the random variable , so are and . These parameters satisfy
[TABLE]
The main results of this paper are the following Theorems 2.1-2.6.
Theorem 2.1**.**
Let be a sequence of i.i.d. random variables drawn from the distribution function of the random variable . Then
[TABLE]
and there exists an increasing positive integer sequence (which depends on the probability distribution of when ) such that
[TABLE]
Theorem 2.2**.**
Let be a sequence of i.i.d. random variables drawn from the distribution function of the random variable . Then
[TABLE]
and there exists an increasing positive integer sequence (which depends on the probability distribution of when ) such that
[TABLE]
Remark 2.1**.**
We must point out that (2.2) and (2.4) are two interesting conclusions. To see this, let be a sequence of independent random variables with
[TABLE]
Since
[TABLE]
it follows from the Borel-Cantelli lemma that
[TABLE]
However, for any sequences and of increasing positive integers,
[TABLE]
Remark 2.2**.**
For an observable random variable , it is often the case that . However, for any given constants and with , one can construct a random variable such that
[TABLE]
For example, if , a random variable can be constructed having probability distribution given by
[TABLE]
where , and
[TABLE]
Combining Theorems 2.1 and 2.2, we establish a law of large numbers for as follows.
Theorem 2.3**.**
Let be a sequence of i.i.d. random variables drawn from the distribution function of the random variable and let . Then the following four statements are equivalent:
[TABLE]
[TABLE]
[TABLE]
[TABLE]
If , then anyone of (2.5)-(2.8) holds if and only if there exists a function such that
[TABLE]
The following result provides large deviation probabilities for , .
Theorem 2.4**.**
Let be a sequence of i.i.d. random variables drawn from the distribution function of the random variable . If (2.6) holds for some , then
[TABLE]
and
[TABLE]
Remark 2.3**.**
If (2.6) holds for some , it then follows from (2.10) and (2.11) that
[TABLE]
and
[TABLE]
and hence
[TABLE]
The following result concerns convergence in distribution for .
Theorem 2.5**.**
Let be a sequence of i.i.d. random variables drawn from the distribution function of the random variable . Suppose that there exist constants and and a monotone function with such that
[TABLE]
Then
[TABLE]
We now return to the problem posed in Section 1. Note that, for
[TABLE]
and
[TABLE]
We thus have that
[TABLE]
Thus, by Theorems 2.1-2.5, some asymptotic properties of the point estimator are provided in the following theorem.
Theorem 2.6**.**
Let be a sequence of i.i.d. random variables drawn from the distribution function of the random variable . Let
[TABLE]
Then we have:
(i)**
[TABLE]
[TABLE]
and the following three statements are equivalent:
[TABLE]
[TABLE]
[TABLE]
If , then anyone of (2.15)-(2.17) holds if and only if there exists a function such that
[TABLE]
(ii)* If (2.16) holds for some , then*
[TABLE]
and
[TABLE]
and hence
[TABLE]
(iii)* Suppose that there exist constants and and a monotone function with such that*
[TABLE]
Then
[TABLE]
Remark 2.4**.**
From Theorem 2.6, one can see that the point estimator posseses some nice asymptotic properties. In particular, it follows from (2.18) that
[TABLE]
Thus, under very reasonable conditions on , is a good candidate to be used for estimating since it is not only a consistent estimator of but also possesses a very good convergence rate.
3 Proofs of the main results
Let be a sequence of events. As usual the abbreviation stands for the event that the events occur infinitely often. That is,
[TABLE]
For events and , we say a.s. if where . To prove Theorem 2.1, we use the following preliminary result which can be found in Chandra (2012, Example 1.6.25 (a), p. 48).
Lemma 3.1**.**
Let be a nondecreasing sequence of positive real numbers such that
[TABLE]
and let be a sequence of random variables. Then
[TABLE]
Proof of Theorem 2.1 Case I: . For given , let . Then
[TABLE]
and hence
[TABLE]
By the Borel-Cantelli lemma, (3.1) implies that
[TABLE]
By Lemma 3.1, we have
[TABLE]
and hence
[TABLE]
Thus
[TABLE]
Letting , we get
[TABLE]
By the definition of , we have that
[TABLE]
which is equivalent to
[TABLE]
Then, inductively, we can choose positive integers such that
[TABLE]
Note that, for any , . Thus, for all sufficiently large , we have that
[TABLE]
Since , by the Borel-Cantelli lemma, we get that
[TABLE]
which ensures that
[TABLE]
Clearly, (2.1) and (2.2) follow from (3.2) and (3.3).
Case II: . Using the same argument used in the first half of the proof for Case I, we get that
[TABLE]
and hence
[TABLE]
Note that
[TABLE]
We thus have that
[TABLE]
It thus follows from (3.4) and (3.5) that
[TABLE]
proving (2.1) and (2.2) (with , ).
Case III: . By the definition of , we have that
[TABLE]
which is equivalent to
[TABLE]
Then, inductively, we can choose positive integers such that
[TABLE]
Thus, for all sufficiently large , we have by the same argument as in Case I that
[TABLE]
and hence by the Borel-Cantelli lemma
[TABLE]
which ensures that
[TABLE]
Thus (2.1) and (2.2) hold. This completes the proof of Theorem 2.1.
Proof of Theorem 2.2 Case I: . For given , let and . Then and . By the definition of , we have that
[TABLE]
and hence for all sufficiently large ,
[TABLE]
Thus, for all sufficiently large ,
[TABLE]
and hence
[TABLE]
Since
[TABLE]
by the Borel-Cantelli lemma, we have that
[TABLE]
which implies that
[TABLE]
Letting , we get
[TABLE]
Again, by the definition of , we have that
[TABLE]
which is equivalent to
[TABLE]
Then, inductively, we can choose positive integers such that
[TABLE]
Then we have that
[TABLE]
Thus, by the Borel-Cantelli lemma, we get that
[TABLE]
which ensures that
[TABLE]
Clearly, (2.3) and (2.4) follow from (3.6) and (3.7).
Case II: . By the definition of , we have that
[TABLE]
which is equivalent to
[TABLE]
Then, inductively, we can choose positive integers such that
[TABLE]
Thus
[TABLE]
and hence the Borel-Cantelli lemma
[TABLE]
which ensures that
[TABLE]
It is clear that
[TABLE]
It thus follows from (3.8) and (3.9) that
[TABLE]
i.e., (2.3) and (2.4) hold.
Case III: . Using the same argument used in the first half of the proof for Case I, we get that
[TABLE]
Letting , we get that
[TABLE]
Thus
[TABLE]
and hence (2.3) and (2.4) hold with , .
Proof of Theorem 2.3 It follows from Theorems 2.1 and 2.2 that
[TABLE]
Since (2.6) follows from (2.5), we only need to show that (2.6) implies (2.8). It follows from (2.6) that
[TABLE]
Since, for
[TABLE]
and
[TABLE]
it follows from (3.10) that
[TABLE]
which is equivalent to (2.8).
For , note that
[TABLE]
We thus see that, if , then (2.8) is equivalent to
[TABLE]
(We leave it to the reader to work out the details of the proof.) We thus see that (2.8) implies (2.9) with , . It is easy to verify that (2.8) follows from (2.9). This completes the proof of Theorem 2.3.
Proof of Theorem 2.4 Since (2.6) holds for some , it follows from the proof of Theorem 2.3 that the function , satisfies
[TABLE]
Thus, for fixed and , we have that, as ,
[TABLE]
and
[TABLE]
We thus have that
[TABLE]
and
[TABLE]
i.e., (2.10) and (2.11) hold.
Proof of Theorem 2.5 For fixed , write
[TABLE]
Then
[TABLE]
Since is a monotone function with , is a slowly varying function such that and hence
[TABLE]
Clearly,
[TABLE]
It thus follows from (2.13) that, as ,
[TABLE]
so that
[TABLE]
i.e., (2.14) holds.
Proof of Theorem 2.6 Since , , Theorem 2.6 (i) follows immediately from Theorems 2.1-2.3.
Since
[TABLE]
[TABLE]
Theorem 2.6 (ii) follows from Theorem 2.4.
Under the conditions of Theorem 2.6 (iii), by Theorem 2.5 we have that
[TABLE]
and hence
[TABLE]
Since is a monotone function with , is a slowly varying function and hence
[TABLE]
i.e.,
[TABLE]
Thus
[TABLE]
Thus, for fixed , we have that
[TABLE]
It now follows from (3.11) and (3.12) that
[TABLE]
and hence
[TABLE]
This proves Theorem 2.6 (iii).
4 Hypothesis testing for the power of moments
We now return to the statistical problem addressed in Section 1. Let be a random sample of size drawn from an observable random variable with unknown distribution function . Let be the power of moments of the random variable . Since, under very reasonable conditions on , is not only a consistent estimator of but also possesses a very good convergence rate, we use to estimate . Let be a specific value. In order to determine that is greater than , we conduct the following test of hypothesis for :
[TABLE]
and use to test (4.1).
Let be the observed value of based on an obtained data set. Then, for testing (4.1), under very reasonable conditions on , it follows from Theorem 2.6 (ii) that
[TABLE]
Let be a given level of significance. If the calculated p-value is greater than , we then fail to reject the null hypothesis at the level of significance. Otherwise, there is sufficient evidence (at the level of significance) to conclude that the alternative hypothesis is true.
Although the formula (4.2) can be used to calculate the p-value approximately for testing (4.1), it does not provide for us such a formula for the case . The following example shows us how the p-value can be found for the case .
Example 4.1**.**
Let be a random sample of size drawn from a population random variable such that
[TABLE]
where , , and are constants. For the case , we have that, for all sufficiently large
[TABLE]
5 A theoretical application of the main results
Let be a sequence of i.i.d. random variables drawn from the distribution function of the random variable . Then is called the maxima sequence associated with . Thus we see that our main results are actually stability theorems for the maxima sequence. The stability properties for the maxima sequence, which are useful in many practical situations where we are interested in extreme behaviour rather than average behaviour, have been studied by Gnedenko (1943), Barndorff-Nielsen (1963), Tomkins (1986), and many other authors.
The following classical and well-known stability theorem is due to Barndorff-Nielsen (1963).
Barndorff-Nielsen Stability Theorem Let be a sequence of i.i.d. random variables drawn from the distribution function of the random variable with
[TABLE]
Then there exists a sequence of real numbers such that
[TABLE]
if and only if
[TABLE]
In either case, the sequence may be assumed to be
[TABLE]
Since it usually can be very complicated to check whether the integral in (5.2) is convergent or divergent and to find , it is natural for us to seek an easy approach to see whether (5.1) holds and if so, to find easily and quickly. As an application of our Theorem 2.3, in this section we will provide such a powerful method; see Theorem 5.2 below.
First, our main results will be used to establish the following stability theorem for the maxima sequence.
Theorem 5.1**.**
Let and let . Let be a sequence of i.i.d. random variables drawn from the distribution function of the random variable . Write
[TABLE]
Then we have
[TABLE]
and the following four statements are equivalent:
[TABLE]
[TABLE]
[TABLE]
[TABLE]
Proof Since
[TABLE]
[TABLE]
we have that
[TABLE]
and
[TABLE]
Note that
[TABLE]
and, for
[TABLE]
We thus see that
[TABLE]
and
[TABLE]
and hence by Theorem 2.3
[TABLE]
and
[TABLE]
(i.e., (5.3) holds) and the statements (5.4), (5.5), (5.6), and (5.7) are equivalent.
Remark 5.1**.**
For and , note that
[TABLE]
We thus see that, if and , then (5.7) is equivalent to
[TABLE]
Thus
[TABLE]
if and only if there exists a function defined on such that
[TABLE]
The following result is more general than that in Remark 5.1 provided .
Theorem 5.2**.**
Let be a sequence of i.i.d. random variables drawn from the distribution function of the random variable such that
[TABLE]
for some increasing and continuous function and some function . If
[TABLE]
then
[TABLE]
Proof Since is an increasing and continuous function,
[TABLE]
It thus follows from (5.8) that
[TABLE]
and hence, by Theorem 5.1 with and ,
[TABLE]
i.e., almost surely
[TABLE]
Thus (5.9) implies that almost surely
[TABLE]
i.e.,
[TABLE]
Since
[TABLE]
we see that (5.10) follows from (5.11).
Theorem 5.2 can be used to determine the asymptotic behavior very quickly for the maxima sequence . This will be illustrated by the following three simple examples.
Example 5.1**.**
Let be a sequence of i.i.d. random variables drawn from a standard normal random variable . It is well known that
[TABLE]
where and , . Clearly,
[TABLE]
and condition (5.9) is fulfilled. Thus, by Theorem 5.2, we have that
[TABLE]
Remark 5.2**.**
Let be a sequence of i.i.d. random variables drawn from a standard normal random variable . We must point out that the stability properties for the maxima sequence have been well studied by many authors. For example, Gnedenko (1943) proved that
[TABLE]
which also yields (5.12) via Theorem 5.1. Goodman (1988) established a strong version of (5.13) in a Banach space setting. In particular, it follows from Theorem 2.1 of Goodman (1988) that
[TABLE]
and
[TABLE]
and hence
[TABLE]
Example 5.2**.**
Let be a sequence of i.i.d. random variables drawn from a Poisson random variable with parameter ; i.e.,
[TABLE]
Note that
[TABLE]
and, by Stirling’s formula,
[TABLE]
Thus, as
[TABLE]
where
[TABLE]
and
[TABLE]
Note that
[TABLE]
Thus it is easy to verify that all conditions of Theorem 5.2 are fulfilled with and . Hence, by Theorem 4.2, we have that
[TABLE]
Example 5.3**.**
Let be a sequence of i.i.d. random variables drawn from a random variable with
[TABLE]
where and are parameters. Let
[TABLE]
Then
[TABLE]
and all conditions of Theorem 5.2 are fulfilled with and . Thus, by Theorem 5.2, we have that
[TABLE]
Acknowledgments
The research of Deli Li was partially supported by a grant from the Natural Sciences and Engineering Research Council of Canada (Grant #: RGPIN-2014-05428) and the research of Shuhua Zhang was partially supported by the National Natural Science Foundation of China (Grant #: 91430108 and 11171251).
References
Barndorff-Nielsen, O.: On the limit behaviour of extreme order statistics. Ann. Math. Statist. 34, 992-1002 (1963) 2. 2.
Chandra, T.K.: The Borel-Cantelli Lemma. Springer, Heidelberg (2012) 3. 3.
Chow, Y. S., Teicher, H.: Probability Theory: Independence, Interchangeability, Martingales., Third Ed. Springer-Verlag, New York (1997) 4. 4.
Gnedenko, B.V.: Sur la distribution limite du terme maximum d’une série aléatoire. Ann. of Math. (2) 44, 423-453 (1943) 5. 5.
Goodman, V.: Characteristics of normal samples. Ann. Probab. 16, 1281-1290 (1988) 6. 6.
Tomkins, R.J.: Regular variation and the stability of maxima. Ann. Probab. 14, 984-995 (1986)
