
TL;DR
This paper demonstrates that certain subsets of spheres with limited antipodal pairs exhibit strong negative type, enabling unique probability measure determination and consistent statistical tests based on expected distances.
Contribution
It establishes that subsets of spheres with at most one antipodal pair have strong negative type, extending the understanding of negative type properties and their implications for statistical testing.
Findings
Subsets with at most one antipodal pair have strong negative type.
Expected distances uniquely determine probability measures on these sets.
Distance covariance tests are consistent against all alternatives in such sets.
Abstract
It is known that spheres have negative type, but only subsets with at most one pair of antipodal points have strict negative type. These are conditions on the (angular) distances within any finite subset of points. We show that subsets with at most one pair of antipodal points have strong negative type, a condition on every probability distribution of points. This implies that the function of expected distances to points determines uniquely the probability measure on such a set. It also implies that the distance covariance test for stochastic independence, introduced by Sz\'ekely, Rizzo and Bakirov, is consistent against all alternatives in such sets. Similarly, it allows tests of goodness of fit, equality of distributions, and hierarchical clustering with angular distances. We prove this by showing an analogue of the Cram\'er--Wold theorem.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
\beginpackages\usepackage
url \usepackagecolor \endpackages
Strong Negative Type in Spheres
by Russell Lyons
Abstract. It is known that spheres have negative type, but only subsets with at most one pair of antipodal points have strict negative type. These are conditions on the (angular) distances within any finite subset of points. We show that subsets with at most one pair of antipodal points have strong negative type, a condition on every probability distribution of points. This implies that the function of expected distances to points determines uniquely the probability measure on such a set. It also implies that the distance covariance test for stochastic independence, introduced by Székely, Rizzo and Bakirov, is consistent against all alternatives in such sets. Similarly, it allows tests of goodness of fit, equality of distributions, and hierarchical clustering with angular distances. We prove this by showing an analogue of the Cramér–Wold theorem.
††2010 Mathematics Subject Classification. Primary 51K99, 51M10, 45Q05, 44A12. Secondary 62H20, 62G20, 62H15, 62H30. ††Key words and phrases. Cramér–Wold, hemispheres, expected distances, distance covariance, equality of distributions, goodness of fit, hierarchical clustering.††Research partially supported by NSF grant DMS-1612363.
§1. Introduction.
We introduce the topic by borrowing from [MR3421595???].
Let be a metric space. One says that has negative type if for all and all lists of red points and blue points in , the sum of the distances between the ordered pairs of points of opposite color is at least the sum \sum_{i,j}{\mathchoice{\hbox{\displaystyle\left(\vbox to8.50006pt{}\right.\n@space}}{\hbox{\textstyle\left(\vbox to8.50006pt{}\right.\n@space}}{\hbox{\scriptstyle\left(\vbox to5.95004pt{}\right.\n@space}}{\scriptscriptstyle\hbox{\scriptstyle\left(\vbox to4.25003pt{}\right.\n@space}}}d(x_{i},x_{j})+d(x^{\prime}_{i},x^{\prime}_{j}){\mathchoice{\hbox{\displaystyle\left)\vbox to8.50006pt{}\right.\n@space}}{\hbox{\textstyle\left)\vbox to8.50006pt{}\right.\n@space}}{\hbox{\scriptstyle\left)\vbox to5.95004pt{}\right.\n@space}}{\scriptscriptstyle\hbox{\scriptstyle\left)\vbox to4.25003pt{}\right.\n@space}}} of the distances between the ordered pairs of points of the same color. It is not obvious that euclidean space has this property, but it is well known. By considering repetitions of and taking limits, we arrive at a superficially more general property: For all , , and with , we have
[TABLE]
We say that has strict negative type if, for every and all -tuples of distinct points , equality holds in (1.1) only when for all . Again, euclidean spaces have strict negative type. A simple example of a metric space of non-strict negative type is on a 2-point space, i.e., with the -metric.
A (Borel) probability measure on has finite first moment if for some (hence all) ; write for the set of such probability measures. Suppose that . By approximating by probability measures of finite support, we obtain a yet more general property, namely, that when has negative type,
[TABLE]
We say that has strong negative type if it has negative type and equality holds in (1.2) only when . See [MR3813995(author)???] ([MR3813995(year)???]) for an example of a (countable) metric space of strict but not strong negative type. The notion of strong negative type was first defined by [MR1163396???]. [Lyons:dcov???] used it to show that a metric space has strong negative type iff the theory of distance covariance holds in just as in euclidean spaces, as introduced by [MR2382665???]. Similarly, it allows tests of goodness of fit, equality of distributions, and hierarchical clustering with angular distances: see the review in [SZ:energy???]. [Lyons:dcov???] noted that if has negative type, then has strong negative type when .
Define
[TABLE]
for and . [Lyons:dcov???] remarked that if has negative type, then the map is injective on iff has strong negative type. (There are also metric spaces not of negative type for which is injective.)
A list of metric spaces of negative type appears as Theorem 3.6 of [Meckes:pdms???]. All euclidean spaces have strong negative type; see [Lyons:dcov???] for a discussion of various proofs.
That real and complex hyperbolic spaces have negative type was shown by [MR0215331???], Sec. 4, and was made explicit by [MR0365042???], Corollary 7.4; that they have strict negative type was shown by [MR1855636???]. [MR3421595???] showed that real hyperbolic spaces have strong negative type. The remaining constant-curvature, simply connected spaces are spheres.
Let denote the unit-radius sphere centered at the origin of . Although spheres have negative type (in their intrinsic metric), not even circles have strict negative type. For example, in , take two red points \bigl{\{}(1,0),(-1,0)\bigr{\}} and two blue points \bigl{\{}(0,1),(0,-1)\bigr{\}}. Nevertheless, antipodal symmetry is the only obstruction to strict negative type: the main result, Theorem 9.1, of [MR1484084???] is that a subset of a sphere has strict negative type iff that subset contains at most one pair of antipodal points. We strengthen this to strong negative type:
Theorem 1.1. If contains at most one pair of antipodal points, then has strong negative type.
We begin by proving a special case:
Theorem 1.2. If is an open hemisphere, then has strong negative type.
We may parametrize open hemispheres as
[TABLE]
for . A crucial ingredient in the proof of Theorem 1.2 is an analogue of the Cramér–Wold theorem:
Theorem 1.3. Let be an open hemisphere in an -dimensional sphere, . For a finite signed measure on and , define . The map is injective. Moreover, if is a dense subset of , then is injective.
Let be the reflection in the origin. If is a Borel subset of such that and its image under partition and such that the interior of is a hemisphere, then call a partitioning hemisphere. Given a probability measure on , let denote the maximal measure that is invariant under and such that . Note that if is a probability measure on that is invariant under , then for every partitioning hemisphere, . Therefore, for every probability measure on with , there is a probability measure such that for every partitioning hemisphere, . Moreover, if but for every -dimensional great sphere in , then there is another probability measure such that for every open hemisphere, . We extend Theorem 1.3 to show the converse, which we use to prove Theorem 1.1:
Theorem 1.4. Let be a probability measure on such that . If is a probability on such that for every partitioning hemisphere, , then . Similarly, if is a probability on such that for every belonging to a dense subset of , then .
The last assertion of Theorem 1.4 is essentially known: see, e.g., Lemmas 2.3 and 2.4 of [MR1753484???].
We also have the following fact:
Proposition 1.5. If is a probability measure on such that for every partitioning hemisphere, , then is -invariant. Similarly, if is a probability measure on such that for a dense set of , then is -invariant.
The last assertion is again essentially known: see Korollar 3.2 of [MR0275286???]. The work of [MR1753484???] and [MR0275286???], as well as other authors who study related questions, uses spherical harmonics. This is a powerful tool that leads to more general results, although those extensions do not seem relevant to negative type. We give elementary proofs that rely only on the Cramér–Wold theorem for euclidean spaces:
Theorem 1.6. If is a complex Borel measure on such that for every open halfspace in , then .
*Proof. *For , define on by
[TABLE]
Then , whence its Fourier transform satisfies for all . Because , it follows that the Fourier transform of also vanishes, whence so does .
Even this theorem can be proved without Fourier analysis—see [Walther:withAdd???] or [MR3750244???].
§2. Proofs.
Proof of Theorem 1.3. By the bounded convergence theorem, determines for all such that \mu\bigl{(}\partial(H\cap H_{t})\bigr{)}=0, and therefore determines all of by continuity from below: for every , there are such that \mu\bigl{(}\partial(H\cap H_{s_{k}})\bigr{)}=0 and increase to .
We may take to be the upper open hemisphere, \bigl{\{}(t_{1},t_{2},\ldots,t_{n+1})\in S^{n}\,;\;t_{n+1}>0\bigr{\}}. Define by
[TABLE]
Then is a homeomorphism from to the affine hyperplane H^{\prime}\mathrel{\mathop{\mathchar 58\relax}}=\bigl{\{}(t_{1},t_{2},\ldots,t_{n+1})\in{{R}}^{n+1}\,;\;t_{n+1}>0\bigr{\}}, namely, is the intersection of with the line through the origin and . Furthermore, maps to an open halfspace in and every open halfspace in is the image under of some . Therefore, determines the measures of all open halfspaces with respect to the pushforward on . The classical theorem of Cramér and Wold applied to shows that this determines , which in turn determines .
Proof of Theorem 1.2. Write for the volume measure on normalized to have mass . Then for all , we have
[TABLE]
This well-known fact is easy to see: By rotation-invariance of , the right-hand side depends only on . By considering three points on a great circle, we find that the dependence is linear. Finally, by taking antipodal points, we verify that the constant of linearity is 1.
Therefore, if and are probabilities on , we may write
[TABLE]
Expanding the square in the integrand and using the facts that
[TABLE]
and
[TABLE]
for any finite signed measure, , we obtain that
[TABLE]
It is evident from this that has negative type. In order to prove has strong negative type, it suffices to show that if and are concentrated on and satisfy for -a.e. , then . But this is immediate from Theorem 1.3.
Given any signed measure , define the antisymmetric measure , where is the pushforward of by . For positive with , we have , the positive part of . For positive without assuming that , we have
[TABLE]
Lemma 2.1. Let and be probability measures on . If for every partitioning hemisphere, , then . Similarly, if for every , where is a dense subset of , then .
*Proof. *We claim that there is an -dimensional great sphere in with . To see this, we build inductively by dimension. First, because only countably many points have positive mass, there is a pair of antipodal points with . Second, all uncountably many -dimensional great spheres in that contain have pairwise intersections exactly , whence there is a -dimensional great sphere with . We may continue this procedure recursively, finding a -dimensional great sphere for with . Finally, take .
Let be one of the two open hemispheres comprising . Note that under either assumption (in the second case, we use a continuity argument like that at the start of the proof of Theorem 1.3).
Let be a partitioning hemisphere. Because and , we have
[TABLE]
A similar equation holds for . Hence, the assumption that for every partitioning hemisphere, , yields
[TABLE]
for every such .
Now every set is of the form for some partitioning hemisphere, . It follows that
[TABLE]
for every , whence by Theorem 1.3, it follows that .
We now prove the second assertion of the lemma. Note that in the preceding proof, we did not use the full strength of the assumption that for every partitioning hemisphere, , but only that for partitioning hemispheres satisfying for ; we may also require that . Let be such that , and let be on the geodesic segment from to with , , and . (Such exist because .) Let converge to as . By the bounded convergence theorem, and similarly for , whence . In addition, we have and similarly for . Hence, for every , whence .
Proof of Theorem 1.4. By Lemma 2.1, either assumption implies that . We may conclude from (2.1) that , whence again from (2.1), that . Since also , we obtain the desired conclusion, .
Proof of Theorem 1.1. If contains no antipodal points, then every concentrated on has , whence the proof that has strong negative type is exactly as for Theorem 1.2, using Theorem 1.4 in place of Theorem 1.3.
If contains one antipodal pair, , then it still suffices to show that for probabilities and concentrated on , the assumption for a dense set of implies . By Lemma 2.1, such an assumption yields . Because and is a positive measure with , and similarly for , we obtain and . Therefore, for a dense set of . Because and are supported by , it follows that , and so , as desired.
Proof of Proposition 1.5. For both assertions, we may apply Lemma 2.1 to the pair of measures and , getting , whence . Thus, , as desired.
Acknowledgement. I thank Marcos Matabuena for asking me about strong negative type for the angular metric on compositional data, i.e., on the probability simplex.
REFERENCES
Faraut, J. and Harzallah, K. (1974). Distances hilbertiennes invariantes sur un espace homogène. Ann. Inst. Fourier (Grenoble), 24(3), xiv, 171–217.
Gangolli, R. (1967). Positive definite kernels on homogeneous spaces and certain stochastic processes related to Lévy’s Brownian motion of several parameters. Ann. Inst. H. Poincaré Sect. B (N.S.), 3, 121–226.
Hjorth, P., Lisonĕk, P., Markvorsen, S., and Thomassen, C. (1998). Finite metric spaces of strictly negative type. Linear Algebra Appl., 270, 255–273.
Hjorth, P.G., Kokkendorff, S.L., and Markvorsen, S. (2002). Hyperbolic spaces are of strictly negative type. Proc. Amer. Math. Soc., 130(1), 175–181 (electronic).
Lyons, R. (2013). Distance covariance in metric spaces. Ann. Probab., 41(5), 3284–3305.
Lyons, R. (2014). Hyperbolic space has strong negative type. Illinois J. Math., 58(4), 1009–1013.
Lyons, R. (2018). Errata to “Distance covariance in metric spaces” [MR3127883]. Ann. Probab., 46(4), 2400–2405.
Lyons, R. and Zumbrun, K. (2018). A calculus proof of the Cramér–Wold theorem. Proc. Amer. Math. Soc., 146(3), 1331–1334.
Meckes, M.W. (2013). Positive definite metric spaces. Positivity, 17(3), 733–757.
Rubin, B. (1999). Inversion and characterization of the hemispherical transform. J. Anal. Math., 77, 105–128.
Schneider, R. (1970). Über eine Integralgleichung in der Theorie der konvexen Körper. Math. Nachr., 44, 55–75.
Székely, G.J. and Rizzo, M.L. (2017). The energy of data. Annu. Rev. Stat. Appl., 4, 447–479.
Székely, G.J., Rizzo, M.L., and Bakirov, N.K. (2007). Measuring and testing dependence by correlation of distances. Ann. Statist., 35(6), 2769–2794.
Walther, G. (1997). On a conjecture concerning a theorem of Cramér and Wold. J. Multivariate Anal., 63(2), 313–319. Addendum, J. Multivariate Anal. 67(2) (1998), 431.
Zinger, A.A., Kakosyan, A.V., and Klebanov, L.B. (1992). A characterization of distributions by mean values of statistics and certain probabilistic metrics. J. Soviet Math., 59(4), 914–920. Stability problems for stochastic models. Translated from Problemy Ustoichivosti Stokhasticheskikh Modelei, Trudy Seminara, 1989, pp. 47–55.
Department of Mathematics, 831 E. 3rd St., Indiana University, Bloomington, IN 47405-7106
