Estimators of the correlation coefficient in the bivariate exponential distribution
W. J. Szajnowski

TL;DR
This paper derives a lower bound on the estimation error of the correlation coefficient in bivariate exponential distributions and evaluates the efficiency of three nonlinear estimators, highlighting their performance across different correlation ranges.
Contribution
It introduces a finite-support parameter constraint to establish a lower bound and compares the optimality of three nonlinear estimators for the correlation coefficient.
Findings
The cosine similarity-based estimator is highly efficient for correlation > 0.35.
The transformed Pearson correlation performs better for smaller correlation values.
A lower bound on estimation error is derived under the finite-support constraint.
Abstract
A finite-support constraint on the parameter space is used to derive a lower bound on the error of an estimator of the correlation coefficient in the bivariate exponential distribution. The bound is then exploited to examine optimality of three estimators, each being a nonlinear function of moments of exponential or Rayleigh observables. The estimator based on a measure of cosine similarity is shown to be highly efficient for values of the correlation coefficient greater than 0.35; for smaller values, however, it is the transformed Pearson correlation coefficient that exhibits errors closer to the derived bound.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Distribution Estimation and Applications · Advanced Statistical Methods and Models · Statistical Methods and Bayesian Inference
Estimators of the correlation coefficient
in the bivariate exponential distribution
W. J. Szajnowski W. J. Szajnowski, is with Centre for Vision, Speech and Signal Processing, University of Surrey, Guildford, U.K., e-mail: [email protected] work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible. In such a case, however, a modified version of this work will be accessible.
Abstract
A finite-support constraint on the parameter space is used to derive a lower bound on the error of an estimator of the correlation coefficient in the bivariate exponential distribution. The bound is then exploited to examine optimality of three estimators, each being a nonlinear function of moments of exponential or Rayleigh observables. The estimator based on a measure of cosine similarity is shown to be highly efficient for values of the correlation coefficient greater than 0.35; for smaller values, however, it is the transformed Pearson correlation coefficient that exhibits errors closer to the derived bound.
Index Terms:
Deterministic parameter estimation, envelope correlation coefficient, estimation error lower bounds
I Introduction
The bivariate exponential and Rayleigh probability distributions, [1, pp. 401–475], [2], play a prominent role in the development of models of dependent nondeterministic phenomena in science and engineering. Such models include power of a random signal received at multiple sensors exploiting time/space/frequency diversity, weather radar returns observed at co-polar and cross-polar channels, weights of edges in random graphs being matched, time intervals between significant events occuring in parts of a complex biological or man-made system and many more.
The statistical association between observables of interest can be characterized by exploiting various measures of dependence, such as mutual information, copulas and parametric or nonparametric correlation coefficients [1, pp. 105–177], [3]. In practice, the correlation coefficient appears to be a preferred choice owing to its computational simplicity, and also the fact that it can be functionally related to copulas and mutual information [3], [4].
The problem of estimating the correlation coefficient between non-negative observables has been discussed in a number of publications [5]–[8]. However, since the finite-support constraint on the parameter space has been ignored, no conclusions regarding optimality of proposed estimators could be drawn. Therefore, it is of interest to establish a constrained lower bound on the estimator error and examine estimators that could attain this bound.
II Rayleigh and Exponential Distributions
Consider two complex Gaussian random variables (rvs), and , where . The four jointly Gaussian components, , have all zero means, , where denotes expectation, and their covariance matrix is of the form [9]
[TABLE]
where and are correlation coefficients between respective rvs.
In signal processing, the complex Gaussian rvs, and , may be viewed as discrete-time samples of two dependent complex Gaussian processes and . The rvs, and , may also represent samples, taken at different time instants, say, and , of a single stationary complex Gaussian process ; in such a case, and .
II-A Bivariate Rayleigh Distribution
Pairs of rvs, and , can be used to construct two Rayleigh rvs, and , as follows
[TABLE]
The rvs and represent magnitudes of the corresponding underlying complex Gaussian rvs and .
The joint probability density function (pdf) of and is given by [2]
[TABLE]
where , and denotes a modified Bessel function of the first kind of order zero. If , then , where and are marginal Rayleigh pdfs of and , respectively. Therefore, in this case, zero correlation implies statistical independence.
Population joint moments, , of rvs and are given by [9]
[TABLE]
where and are complete elliptic integrals of the first and second kind. In particular,
[TABLE]
and when approaches one, tends to infinity.
II-B Bivariate Exponential Distribution
The transformation
[TABLE]
converts two Rayleigh rvs, and , into two exponential rvs, and . The joint pdf of and can be expressed as [2]
[TABLE]
where . The parameter is, in fact, the correlation coefficient between exponential rvs and (see Section V). Also in this case, when , rvs and are statistically independent.
Population joint moments, , of rvs and are given by [9]
[TABLE]
III Problem Formulation
Assume that observations on rvs and are made in pairs, ; alternatively, observations, , may be made on Rayleigh rvs and . Then, pairs of observations are used to determine sample joint moments,
[TABLE]
corresponding, respectively, to population moments (8) or (4).
This Letter addresses two associated problems:
-
Given the pdf (7) and the constraint, , derive a lower bound on the error of an estimator of the correlation coefficient appearing in (7).
-
Make use of sample moments (9) to construct estimators of and examine their optimality with respect to the derived lower bound.
IV Lower Bounds on Estimation Errors
In the case of a bivariate exponential distribution (7), allowed values of the correlation coefficient are restricted to the -interval. If a statistic employed as an estimator of assumes values from a different, finite or infinite, interval, then the constraint, must be taken into account when establishing a lower bound on the estimator error.
IV-A Cramér-Rao Bound (CRB)
It is known [10] that under suitable regularity conditions, the variance of any unbiased estimator can be bounded by the lower Cramér-Rao bound (CRB). Therefore, the CRB is a useful measure when examining optimality of several competing estimators of a parameter of interest.
Let a vector of nonrandom parameters be defined by
[TABLE]
Then (neglecting any constraints on the parameters), the Fisher information matrix, , is a positive semidefinite symmetric matrix, comprising the elements
[TABLE]
Consequently, a lower bound on the variance of any unbiased estimator of can be determined from
[TABLE]
where is the inverse of .
Elements of the Fisher information matrix , for selected values of , are given in [11]. Values of the Cramér-Rao bound, shown in Table 1, have been determined by selecting a first diagonal element of the inverse of .
IV-B Mean-Square-Error (MSE) Bound
When the parameter space is restricted, the Cramér-Rao approach appears to be inadequate [12]–[14]. Therefore, to determine a lower bound on the error of an estimator of parameter in (7), knowledge of the finite-support constraint, , should be suitably combined with Fisher information contained in available data.
Consider an unbiased estimator of and let be a pdf of . Assume that the estimator is so constructed that values of its realizations (estimates) cannot exceed one. However, depending on a set of processed data, or , some estimates may assume negative, hence not allowed values.
Therefore, when such an aberrant estimate is observed, its value must be set to zero, and so modified estimate, , will assume the form
[TABLE]
Consequently, the pdf of the modified estimator will become a censored distribution [15],
[TABLE]
comprising a discrete probability mass and a continuous part. In (14), is an impulse (Dirac delta) function, is the probability that ,
[TABLE]
and denotes the Heaviside step function,
[TABLE]
Fig. 1 illustrates the effect of transforming the pdf of an estimator into its censored version, , when the value of the correlation coefficient being estimated decreases from to .
The mean-square error (MSE), , of the modified estimator can be expressed as
[TABLE]
In order to determine the lower MSE bound, assume that is a maximum-likelihood (ML) estimator. Since ML estimators are known to be asymptotically unbiased, efficient and Gaussian [10], let . The MS error of a modified estimator can be evaluated by exploiting moments of a censored Gaussian distribution [15].
Let be the pdf of a standard Gaussian rv , and its cumulative distribution function. Then, the MSE of the estimator can be expressed as follows
[TABLE]
where and .
The constrained error bound (17) differs from the CR bound (12), when is less than approximately . In the region, , the estimator becomes biased, and its MS error,
[TABLE]
remains below the CR bound. The bound reduction has resulted from incorporating knowledge of the constraint.
V Estimators of the Correlation Coefficient
Consider the population Pearson product-moment correlation coefficient defined by
[TABLE]
By inserting moments (8) into (19), it can be verified that . Therefore, the sample Pearson correlation coefficient, i.e. the statistic
[TABLE]
can be used to construct a censored estimate of as follows
[TABLE]
When the number of observations tends to infinity, sample moments converge to population moments, and the sample correlation coefficient will approach .
The use of sample correlation coefficient to estimate a population correlation coefficient is a standard practice. However, such an approach may not necessarily lead to an efficient estimator (an estimator whose variance attains the Cramér-Rao bound), especially in small or moderate sample sizes.
V-A Estimator Based on Correlation of Rayleigh Variables
Consider now the bivariate Rayleigh distribution (3) and the population Pearson correlation coefficient , given by a formula analogous to (19). The correlation coefficient can be expressed in terms of moments (4) as follows
[TABLE]
In this case, , only when or ; otherwise, is a nonlinear function of .
When , the sample correlation coefficient will approach (22). By employing the nonlinear transformation
[TABLE]
a censored estimate of is obtained as
[TABLE]
V-B Approximate Maximum-Likelihood Estimator
It has been shown [16] that in a case of highly correlated Rayleigh rvs, and when , an approximate ML estimator of the correlation coefficient is of the form, . The constraint, , can be removed by employing the geometric mean rather than the arithmetic mean. Consequently, the following statistic of cosine-similarity-squared is obtained
[TABLE]
The statistic (25) asymptotically converges to
[TABLE]
For and , the respective limits are and .
When the nonlinear transformation
[TABLE]
is applied, a censored estimate of assumes the form
[TABLE]
Owing to its origin, the estimator is expected to be asymptotically efficient, at least for larger values of .
V-C Performance of the Estimators
Computer simulations were employed to examine the performance of the three estimators, and , of the correlation coefficient . Three sample sizes, and , were chosen, somewhat arbitrarily, to represent the cases of small, moderate, and large sample sizes. Values of the correlation coefficient to be estimated varied from to , in steps of . For each combination of and , Monte Carlo experiment replications were carried out to determine the MS error, , for each of the three estimators.
Results of the study are shown in Fig. 2 along with the MSE bound (17) and the Cramér-Rao bound (12); values of the MSE bound are only shown when they differ from those of the CR bound.
The results can be summarized as follows:
-
The derived MSE lower bound is superior to the standard CRB when predicting errors of estimators of the correlation coefficient ; the MSE lower bound is more precise when the sample size is moderate or large.
-
When is greater than , the estimator is better than the other two estimators, and its estimated MS error, , differs only slightly from the derived lower bound.
-
In the region, , the estimator is superior to the estimator .
-
When , the estimator exhibits the smallest MS error; this observation supports the conclusion in [8] that the sample correlation coefficient (20) is an asymptotically most powerful test of the hypothesis against the alternative .
-
When , the MS error of the estimator markedly exceeds those of the other two estimators; this effect can partly be attributed to the approximate nature of the nonlinearity (27).
VI Conclusion
The non-negativity constraint has been incorporated into the standard CR bounding technique by utilizing moments of a censored Gaussian distribution. The resulting MSE bound establishes a lower bound on the MS error of any estimator of the correlation coefficient of exponentially distributed variables.
The simulation study has shown that MS errors associated with two of the examined estimators are close to the derived lower bound in two subintervals that jointly cover the entire (0,1)-interval. Each of the two estimators is a nonlinear function of a measure of either cosine similarity or centred cosine similarity (i.e. the sample correlation coefficient) between Rayleigh variables.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] N. Balakrishnan and C. D. Lai, Continuous Bivariate Distributions , 2nd ed. New York, NY, USA: Springer, 2009.
- 2[2] R. K. Mallik, ”On multivariate Rayleigh and exponential distributions,” IEEE Trans. Inform. Theory , vol. IT-49, no. 6, pp. 1499–1515, Jun. 2003.
- 3[3] R. S. Calsaverini and R. Vicente, ”An information-theoretic approach to statistical dependence: Copula information,” Europhysics Lett. , vol. 88, no. 6, Dec. 2009, Art. no. 68003.
- 4[4] X. Liu, ”Copulas of bivariate Rayleigh and log-normal distributions,” Electron. Lett. , vol. 46, no. 25, pp. 1669–1671, Dec. 2010.
- 5[5] S. Miyabe, N. Ono and S. Makino, ”Estimating correlation coefficient between two complex signals without phase observation,” in Lecture Notes in Computer Science , vol. LNCS 9237, Vincent E. et al , Eds. Heidelberg: Springer, 2015, pp. 421–428.
- 6[6] M. F. Al-Saleh, and Y. A. Diab, ”Estimation of the parameters of Downton’s bivariate distribution using ranked set sampling scheme,” J. Statist. Plann. Inference , vol. 139. no. 2, pp. 277–286, Feb. 2009.
- 7[7] N. Balakrishnan, H. Keung and T. Ng, ”Improved estimation of the correlation coefficient in a bivariate exponential distribution,” J. Statist. Comput. Simul. , vol. 68, no. 2, pp. 173–184, 2001.
- 8[8] P. A. P Moran, ”Testing for correlation between non-negative variates,” Biometrika , vol. 54, no. 3/4, pp. 385–394, Dec. 1967.
