An integral formula for the powered sum of the independent, identically and normally distributed random variables
Tamio Koyama

TL;DR
This paper derives an integral formula for the distribution of the sum of powered standard normal variables, generalizing chi-squared, which aids in evaluating their density functions.
Contribution
It introduces a new integral representation for the density of the powered sum of i.i.d. normal variables, based on the inversion formula and hyperfunction analysis.
Findings
Provides a convergent integral formula for the density function.
Facilitates evaluation of the distribution of powered sums of normal variables.
Connects the formula with hyperfunction theory.
Abstract
The distribution of the sum of r-th power of standard normal random variables is a generalization of the chi-squared distribution. In this paper, we represent the probability density function of the random variable by an one-dimensional absolutely convergent integral with the characteristic function. Our integral formula is expected to be applied for evaluation of the density function. Our integral formula is based on the inversion formula, and we utilize a summation method. We also discuss on our formula in the view point of hyperfunctions.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsChaos-based Image/Signal Encryption · Bayesian Methods and Mixture Models · Probability and Risk Models
An integral formula for the powered sum of
the independent, identically and normally distributed random variables
Tamio Koyama
Abstract
The distribution of the sum of -th power of standard normal random variables is a generalization of the chi-squared distribution. In this paper, we represent the probability density function of the random variable by an one-dimensional absolutely convergent integral with the characteristic function. Our integral formula is expected to be applied for evaluation of the density function. Our integral formula is based on the inversion formula, and we utilize a summation method. We also discuss on our formula in the view point of hyperfunctions.
1 Introduction
Let be a sequence of the independent, identically distributed random variables, and suppose the distribution of each variable is the standard normal distribution. Fix a positive integer and . In this paper, we discuss the sum of the -th power of standard normal random variables. This distribution function can be written in the form of dimensional integral:
[TABLE]
In the case where , this integral equals to the cumulative distribution function of the chi-squared statistics. The sum of the -th power is an generalization of the chi-squared statistics. The weighted sum , where is a positive real number, of the squared normal variables is another generalization of the chi-squared statistics. Numerical analysis of the cumulative distribution function of the weighted sum is discussed in [1] and [7]. In [8], Marumo, Oaku, and Takemura discussed the numerical analysis of integral (1) in the case where . Their approach was the holonomic gradient method ([9] ,Chapter 6 of [3]). They derived a system of differential equations for the probability density function of . By numerically solving the system of differential equations, they evaluate the probability density function. As a generalization of the chi-squared statistics or the sum of cubes of standard normal random variables, the sum of the -th power () of standard normal random variables is a basic quantity in statistics. Detailed investigation on the quantity can be expected to be applied to Hypothesis test, and we are interested in numerical analysis of the probability density function of the quantity.
In order to derive the system of differential equations for the probability density function of . Marumo, Oaku, and Takemura utilize the following inversion formula:
[TABLE]
where is the characteristic function of . Note that Equation (2) is only a formal equation in a naive sence. Actuary, it is not certain whether the integral in the right-hand side of (2) is convergent or not. It is hard to prove the convergence of the integral by utilizing with the Laplace approximation. The discussion in [8] does not show the convergence of (2) and justify Equation (2) by the theory of Schwartz distributions. In this paper, we consider a justification of the formal equation (2) for general from the viewpoint of summation methods and hyperfunctions. As a result of the consideration, we derive a formula which represents the probability density function of as an one-dimensional absolutely convergent integral. This formula is expected to be applied for evaluation of the density function.
Before starting regorous discussion, let us give an intuitive discussion on the formal equation (2). Here, we consider the case where is the characteristic function of a general random varialbe . Note that the distribution of can be the exponential distribution, the binomial distribution or other distribution. Dividing the domain of the formal integral in the right-hand side of (2), it can be written as
[TABLE]
Note that the integral in the first term converges when holds, the second term converges when . Hence, the first and the second term define holomorphic functions on the upper half plane and the lower half plane respectively. Since these holomorphic functions has analytic continuation, the expression (3) make a sense for if is not a singular point of the holomorphic functions. This intuitive explanation can be justified by a summation methods in the case where random variable is the sum of -th power of standard normal random variables. For general random variables, this intuitive explanation is justified by theory of Fourier hyperfunctions. Theory of hyperfunction introduced in [11],[6] has many applications to linear partial differential equations (see, e.g., [2]). Applications of hyperfunctions to numerical analysis are discussed in [4] and [10]. Our discussion in this paper is an application of the theory of hyperfunctions to statistics.
The organization of this paper is as follows. In Section 2, we review the inversion formulae in the probability theory and the Fourier analysis, and we justify the intuitive discussion in Section 1 by a summation method. In Section 3, we review the theory of hyperfunctions of a single variable based on [11] and [5], and we justify the intuitive discussion in Section 1 from the view point of hyperfunctions. In Section 4, we apply the inversion formula given in Section 2 to the sum of -th power of standard normal random variables. We also calculate the limit in the boundary-value representation.
2 Inversion Formula
In this section, we review the inversion formulae in the probability theory and the Fourier analysis.
In the probability theory, the following Lévy’s inversion formula is most famous.
Theorem 1** (Lévy’s inversion formula).**
Let be a random variable and be the characteristic function of . Then, for ,
[TABLE]
holds. In the case where holds, has continuous probability density function and
[TABLE]
holds.
Proof.
see, e.g., [12, p175]. ∎
If has a probability density function , then the characteristic function of is the Fourier transformation of , i.e., . If is a rapidly decreasing function, the inversion formula (4) is holds immediately. If is a square-integrable function, , the inversion formula is
[TABLE]
Note that the right hand-side of (5) is improper integral. Even if is square-integrable, integrand may not be Lebesgue integrable, i.e., can be infinite. We also note that the probability density function may not be square-integrable. For example, the following probability density function is not square-integrable:
[TABLE]
If is a slowly increasing function, Equation (4) can be justified as an equation of Schwartz distributions.
The following proposition is a justification of the intuitive discussion in Section 1 by a summation method.
Proposition 1**.**
Let be a probability density function and be the characteristic function of . For any continuous point of , the equation
[TABLE]
holds.
We utilize a fact in [5, p28, Note 1.3] with a small change.
Lemma 1**.**
Let and be real-valued functions on . Suppose that is absolutely integrable and holds, and that is continuous ans bounded. For a positive number , put
[TABLE]
Then, for any , we have
Proof.
Since is decomposed as
[TABLE]
it is enough to show that the second term converges to zero. We decompose the integral domain into and . On the first domain, we have
[TABLE]
Note that is finite since is bounded. Since is absolutely integrable, converges to [math] as . Consequenty, we have
[TABLE]
On the second domain, we have
[TABLE]
Since is continuous, goes to zero as . Consequently, we have
[TABLE]
Therefore, we have ∎
Proof of Proposition 1.
Fix the continuous point of . Decompose into a sum of two continuous functions and where is support compact and equals to zero on a neighborhood of . Put . By the Fubini’s theorem, we have
[TABLE]
Apply Lemma 1 with , then we have
[TABLE]
Analogously, we have
[TABLE]
Since equals to zero on a neighborhood of , for sufficiently small , the above integral equals to
[TABLE]
By the Lebesgue’s dominated convergence theorem, this integral converges to [math] as . Hence, we have
[TABLE]
∎
3 Perspective of Hyperfunction
In this section, we briefly review the theory of hyperfunctions and justify the intuitive discussion in Section 1 from the view point of hyperfunctions.
In the first, we review the theory of hyperfunctions of a single variable based on [11] and [5]. Let be the sheaf of holomorphic functions on the complex plane . The sheaf of the hyperfunctions of a single variable is defined as the -th derived sheaf of . The global section is the inductive limit with respect to the family of complex neighborhoods . The elements of are called hyperfunctions on . Any hyperfunction on can be represented as a equivalent class with a representative .
Let be the spaces of locally integrable functions on . There is a natural embedding from to . When an integrable function satisfies some condition, we can take a representative of as a holomorphic function on .
Proposition 2** (M. Sato).**
Let be an integrable function on such that the integral is finite. Take a constant , and put
[TABLE]
Then we have .
Proof.
See, [11]. ∎
We call a measure on to be locally integrable when is finite for any compact set . We denote by the space of the locally integrable measures on . Analogous to the case of , there is a natural embedding from to , and the following proposition holds:
Proposition 3** (M. Sato).**
Let be a measure on such that the integral is finite. Take a constant , and put
[TABLE]
Then we have .
Let be a neighborhood of , and put . A function holomorphic on a tubular domain is said to be slowly increasing if for any compact subset and any , there exists such that
[TABLE]
uniformly for . A hyperfunction on is said to be slowly increasing if we can take a slowly increasing function as its representative, i.e., there exists a slowly increasing function such that . Put
[TABLE]
The following definition is a specialization into the case of a single variable of the general definition in [5].
Definition 1**.**
Let a slowly increasing function on a tubular domain be a representative of a slowly increasing hyperfunction . Take a positive number . We define the Fourier transform of as where
[TABLE]
Here, for and a function with a complex variable , we denote the integral by
In the second, we give and show an equation corresponding to the formal equation (2). Let be a random variable. The distribution of and the characteristic function of can be regarded as hyperfunctions. We denote them by and respectively. The following equation corresponds to the formal equation (2).
[TABLE]
Since the characteristic function can be regarded as the inverse Fourier transformation of the distribution under a suitable condition, the equation (6) is an inversion formula.
We calculate defining functions of and . Let be a constant. Since is a probability measure on , we have
[TABLE]
and
[TABLE]
By Proposition 3, we have
[TABLE]
Since the estimation holds, we can apply Proposition 2 to . In fact, we have
[TABLE]
Hence, we have
[TABLE]
We calculate the Fourier transformation of .
Lemma 2**.**
For a random variable , the hyperfunction corresponding to the characteristic function of is slowly increasing.
Proof.
For simplicity, we assume without loss of generality. The absolute value of the representative of satisfies the inequality
[TABLE]
Put , then the inequalities
[TABLE]
imply that the right hand side of (8) is bounded above by
[TABLE]
Hence, is a slowly increasing hyperfunction. ∎
In order to calculate an explicit form of a defining function of the Fourier transformation of , we decompose the function as
[TABLE]
where is the Heaviside function. Embedding the both sides of the above equation into the global section of the sheaf of hyperfunctions, we have
[TABLE]
By Proposition 2, defining functions of and are given by
[TABLE]
respectively. Then we have Analogous to Lemma 2, we can show that both and are slowly increasing hyperfunctions.
For a function with a complex variable , we denote by the integral along the path described in Figure 2. We also denote by the integral along the path described in Figure 2.
We denote by the upper half-plane .
Lemma 3**.**
The holomorphic functions
[TABLE]
are defining functions of the Fourier transformations of and respectively, i.e., we have .
Proof.
By Definition 1, a defining function of is given by
[TABLE]
Since is holomorphic on , we have
[TABLE]
Note that the integral defines a holomorphic function on a neighborhood of . Hence,
[TABLE]
is a defining function of also.
We can show analogously. ∎
Lemma 4**.**
Put
[TABLE]
Then is a defining function of .
Proof.
It is enough to calculate the right hand side of the equation .
For , we have
[TABLE]
By the assumption , we have
[TABLE]
By the Fubini-Tonelli theorem and Cauchy’s integral formula, equals to
[TABLE]
For , we can show analogously. ∎
Lemma 5**.**
For , the following equation holds:
[TABLE]
Proof.
For , we have
[TABLE]
by the definition of the characteristic function. Since is negative, we have
[TABLE]
By the Fubini-Tonelli theorem, the right hand side of (10) equals to
[TABLE]
For , we can show the equation (9) analogously. ∎
Theorem 2**.**
For a random variable , equation (6) holds.
Proof.
By Lemma 5 and equation (7), we have
[TABLE]
∎
Other Proof of Proposition 1.
Let be a random variable whose probability density function is . Then, equals to the characteristic function of . By Lemma 2 and Lemma 4, the hyperfunction corresponding to is a Fourier hyperfunction, and a defining function of is
[TABLE]
By Theorem 2, the corresponding hyperfunction of equals to . For any continuous point of , the boundary-value representation [5, Theorem 1.3.12] implies
[TABLE]
∎
4 Sum of -th Power of the Normal Variables
Fix a positive integer . Let be the independent, identically and normally distributed random variables, and put . Let be the characteristic function of , i.e., put
[TABLE]
Then, the characteristic function of the powered sum equals to . By Proposition 1, we have
[TABLE]
for any continuous point of .
In order to calculate the limit in the right hand side of (12), we discuss on the analytic continuations of the each term of (3).
In the first, we consider the characteristic function in (11). The characteristic function can be decomposed as
[TABLE]
We are interested in the analytic continuation of each term in the right hand side. In order to calculate them, we consider the following general form of integral:
[TABLE]
where the integral path with parameter is defined by
[TABLE]
Note that we have for .
For , put
[TABLE]
Examining the convergence region of the integral (13) for given parameter , we obtain the following lemma:
Lemma 6**.**
Suppose that . For given parameter , the integral (13) converges on .
Proof.
Take any point . By the straight forward calculation, we have
[TABLE]
Since assumption implies , there exist and such that
[TABLE]
holds for any . Hence, we have
[TABLE]
Consequently, the integral (13) converges on . ∎
Lemma 7**.**
Suppose that . If holds, then the integral (13) converges on . Here, denotes the closure of .
Proof.
For any point , we have , This implies
[TABLE]
∎
Example 1**.**
Let . Figures 4 and 4 show the contour of the integral path and the region for respectively. Figures 6 and 6 show them for .
Lemma 8**.**
If , then holds for .
Proof.
Put
[TABLE]
By the Cauchy’s integral formula, we have
[TABLE]
Here, we put By some calculations, we have that there exist and such that
[TABLE]
Taking the limit of (15) as approaches inifinity, we have ∎
In the second, we disscuss on the analytic continuation of the each term of (3). Let us consider the following integral:
[TABLE]
where , , and the integral path is defined by (14).
Lemma 9**.**
Suppose and . Then the integral (16) converges for .
Proof.
By the binomial expansion, the integral (16) equals to
[TABLE]
With some calculation, the each term of the above expression can be written as
[TABLE]
where we put , ,
[TABLE]
By the Fubini-Tonelli theorem, it is enough to consider the convergence of the following integral:
[TABLE]
For , we have . This inequality and the assumption and imply . Hence, the integral (17) equals to
[TABLE]
By the assumptions, the integral (18) is bounded from above by
[TABLE]
Therefore, the integral (16) converges. ∎
Lemma 10**.**
Suppose and for . The equation
[TABLE]
holds for .
Proof.
Put
[TABLE]
By the Cauchy’s integral formula, we have
[TABLE]
where we put By some calculations, we have
[TABLE]
for sufficiently large . Taking the limit of (19) as approaches inifinity, we have ∎
Theorem 3**.**
Let be a sufficiently small positive number and be a continuous point of . When is odd, we have
[TABLE]
When is even, we have
[TABLE]
Proof.
When is odd and , we have
[TABLE]
When is odd and , we have
[TABLE]
When is even and , we have
[TABLE]
When is odd and , we have
[TABLE]
∎
Remark 1**.**
The all of integral representations in Theorem 3 make sense as Lebesgue integral. As we have shown in Lemma 9, when we write the integral in Theorem 3 into an integral on , the integrand is a Lebesgue integrable function.
Acknowledgements: We are grateful to K. Yajima, T. Oshima, and S.-J. Matsubara-Heo for valuable comments to Proposition 1. This work was supported by MEXT/JSPS KAKENHI Grand Numbers JP 25220001, 18J01507.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] A. Castaño-Martínez and F. López-Blázquez. Distribution of a sum of weighted noncentral chi-square variables. TEST , 14(2):397–415, December 2005.
- 2[2] U. Graf. Introduction to Hyperfunctions and Their Integral Transforms . Birkhäuser, Basel, 2010.
- 3[3] T. Hibi. Gröbner Bases: Statistics and Software System . Springer Japan, Tokyo, 2014.
- 4[4] I. Imai. Applied hyperfunction theory . Kluwer Academic Publishers, Dordrecht, 1992.
- 5[5] A. Kaneko. Introduction to Hyperfunctions . KTK Scientific Publishers, Tokyo, 1988.
- 6[6] T. Kawai. On the theory of fourier hyperfunctions and its applications to partial differential equations with constant coefficients. Journal of the Faculty of Science, the University of Tokyo , 17(3):467–517, December 1970.
- 7[7] T. Koyama and A. Takemura. Holonomic gradient method for distribution function of a weighted sum of noncentral chi-square random variables. Computational Statistics , 31(4):1645–1659, December 2016.
- 8[8] N. Marumo, T. Oaku, and A. Takemura. Properties of powers of functions satisfying second-order linear differential equations with applications to statistics. Japan Journal of Industrial and Applied Mathematics , 32:553–572, July 2015.
