Generating an equidistributed net on a unit n-sphere using random rotations
Somnath Chakraborty, Hariharan Narayanan

TL;DR
This paper introduces a randomized algorithm that efficiently generates an $ ext{epsilon}$-net on an n-sphere, enabling accurate approximation of integrals of Lipschitz functions with high probability.
Contribution
It presents a novel randomized method for constructing an $ ext{epsilon}$-net on the sphere using a specific number of random rotations and word combinations, ensuring equidistribution.
Findings
Algorithm succeeds with high probability
Produces an $ ext{epsilon}$-net with controlled size
Effective for approximating integrals of Lipschitz functions
Abstract
We develop a randomized algorithm (that succeeds with high probability) for generating an -net in a sphere of dimension n. The basic scheme is to pick random rotations and take all possible words of length in the same alphabet and act them on a fixed point. We show this set of points is equidistributed at a scale of . Our main application is to approximate integration of Lipschitz functions over an n-sphere.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRandom Matrices and Applications · Advanced Combinatorial Mathematics · Topological and Geometric Data Analysis
Generating an equidistributed net on a unit -sphere using random rotations
Somnath Chakraborty
and
Hariharan Narayanan
School of Technology and Computer Science, Tata Institute of Fundamental Research, Mumbai 400005, India
Abstract.
We develop a randomized algorithm (that succeeds with high probability) for generating an -net in a sphere of dimension . The basic scheme is to pick random rotations and take all possible words of length in the same alphabet and act them on a fixed point. We show this set of points is equidistributed at a scale of . Our main application is to approximate integration of Lipschitz functions over an -sphere.
2000 Mathematics Subject Classification:
Primary 22D40
1. Introduction
In the present article, we develop a randomized algorithm (with high success probability) for the generation of an -net in an unit sphere of dimension . The basic scheme is to pick random rotations and take all possible words on length in the same alphabet and act them on a fixed point. We show this set of points is equidistributed at a scale of . The group can be identified with the three dimensional sphere , thus we obtain a scheme for producing an -net of , a task which is relevant to quantum computing in the context of the Solovay-Kitaev algorithm (although we do not address the issue of generating a sequence of elementary gates to efficiently approximate any given gate, in a non-exhaustive fashion). Our main application is to integration of Lipschitz functions over an -sphere. The net we produce, is -close in Hausdorff distance to the -sphere, and is also equidistributed in the following sense (with high probability): The uniform counting measure over the net is close to the uniform measure on the -sphere in -Wasserstein distance. This implies that the integral of every -Lipschitz function on with respect to is -close to its integral with respect to .
In [2], Alon and Roichman proved that, given any , there exists a such that for any finite group , and a random subset of order at least , the induced Cayley graph has small normalized second largest eigenvalue (in absolute value):
[TABLE]
Considering random walk on expander multigraphs, it follows that every element is an -word of length at most . For an irreducible representation , let be its dimension; let be the regular representation of , and . In [13], Russel and Landau proved that (1.1) holds for all random subsets of order at least
[TABLE]
This was obtained via an application of tail bounds for operator-valued random variables, as in Ahlswede and Winter [1], building upon the following observation: the normalized adjacency matrix of is the operator
[TABLE]
presented in terms of the standard basis of .
Now let be a compact Lie group, and a left-invariant Borel probability measure on . One considers the averaging operator , given by
[TABLE]
In [5], Bourgain and Gamburd established that if then has spectral radius when is finite algebraic subset generating a nonabelian free subgroup of . This has since been extended to all compact connected simple Lie groups by Benoist and de Saxcé in [3], where it was shown that has a spectral gap if and only if is almost diophantine. A corollary to the main result in [3] is the following: if is finitely-supported almost diophantine then the set of words in of fixed length approaches in Hausdorff distance.
In [15], a quantitative version of the spectral gap question was considered. It was shown that the Hausdorff distance between , a compact connected Lie group, and the subset of fixed length words on a random essentially small finite alphabet decays exponentially in the length of the words, with high probability. This was done via an analysis of the heat kernel with respect to a suitable finite dimensional subspace of and an application of tail bounds for operator-valued random variables. We note that the results of the present article are not implied by the results of [15], because the dimension of the Lie group is , and so the bounds from [15] for the length of the words and the number of generators, that apply for general compact Lie groups would be quadratic in rather than linear in . In the special case of the unitary group , such a result with a quadratic dependence on dimension for the length of the words was previously obtained by Hastings and Harrow in Theorem 5 of [10], however in their result the number of generators is specified in a indirect manner, whose dependence on is not obvious. On the other hand, the bounds obtained in the present work are linear in , both for the number of generators and the length of the words. In fact, for these parameters, the value of is close to the volumetric lower bound of on the size of an net of .
The two main results of this paper are stated below. The numberings correspond to their appearances in Sections and respectively. In the following statements, denotes a certain positive constant depending on . For a finite set , the -fold product consists of all -words of length inside the free group generated by .
**Theorem 3.15:
Let and ; let . Let consist of iid random points, drawn from the Haar measure on , where**
[TABLE]
and . Let be the (multi)set of all elements in and their inverses. Let ; if is sufficiently small then the probability that is an -net in is at least .
**Theorem 4.14:
For , let be the probability measure on corresponding to haar probability measure on the group of rotations . Let be sufficiently small and . Let be a random subset such that satisfies**
[TABLE]
where . Let be the (multi)set of all elements in and their inverses. Let and let be the probability measure on , uniformly supported on , where
[TABLE]
If is sufficiently small, then the following inequality holds with probability at least :
[TABLE]
Here is used to denote the Wasserstein distance between two measures supported on .
For the remainder of this section, we assume given a real number model of computation, in which only standard algebraic operations are allowed on Gaussian random vectors, but bits are not manipulated. Thus, for and as described in Theorem 4.14, choose a set consisting of orthogonal matrices, each chosen independently from the Haar measure of . Consider all words of length in these generators and their inverses. Apply the resulting matrices to the vector . Then these points form an equidistributed net, that can be used for integrating a Lipschitz to within an additive error of . If we assume an oracle that outputs independent dimensional Gaussian random vectors when queried, then the whole process requires only queries to this oracle. Note that the obvious procedure of producing an equidistributed net, would require calls to the Gaussian oracle (which would then be normalized to lie on the sphere). The latter method uses exponentially more randomness than our procedure using random rotations.
Acknowledgments
SC would like to thank Sandeep Juneja and Jaikumar Radhakrishnan for helpful conversations. HN was partially supported by a Ramanujan fellowship.
2. Application of Spherical Harmonics
This section briefly reviews the basics of harmonic analysis on the unit -sphere . The two lemmas in this section will be used in an essential manner in deriving the computations in this section.
Let denote the standard euclidean surface probability measure on . For any Borel set , if then
[TABLE]
where is the standard Lebesgue measure on and is the unit disk centered at origin, so that . We recall that the Lebesgue measure of the unit -sphere in is
[TABLE]
Now, the fact that is - invariant follows from usual rotation invariance of Lebesgue measure on . Hence (by unimodularity of the compact Lie group of rotations of ), the measure is the unique probability measure on induced by the haar measure on . For , let be the negative of the Laplace-Beltrami operator on . Thus, given , one has
[TABLE]
where is defined by . It is well-known that the Hilbert-product space decomposes into a direct sum of the eigenspaces of , in the sense that the -closure of the direct sum is :
[TABLE]
Recall that is the space of degree- homogeneous harmonic polynomials in variables, restricted to ; the dimension of is
[TABLE]
and the corresponding eigenvalue is . Note that, for any , one has
[TABLE]
One has
[TABLE]
Moreover, notice that, when , one has
[TABLE]
Equivalently, writing for the eigenspace corresponding to eigenvalue , one gets
[TABLE]
and for all , the inequality implies
[TABLE]
where, the sum ranges over all eigenvalues in . Fix a point . Let be the heat kernel on , corresponding to Brownian motion started at . That is, is the fundamental solution to the problem
[TABLE]
where the convergence is taken to be in the topology. A Brownian motion on , started at has infinitesimal generator . Fixing orthonormal basis of for each , one has
[TABLE]
where the last equality is due to addition theorem of spherical harmonics (see theorem 2.26 of [14]) that correspond to the usual addition formula for trigonometric functions when ; here denotes the Legendre polynomial of degree and dimension ; in explicit terms, this polynomial is
[TABLE]
where the coefficients are given by
[TABLE]
One has
[TABLE]
where the last equation is a well-known properties of Legendre polynomials (see theorem 2.29 of [14]). We note that by theorem 2.29 of [14], one has
[TABLE]
for all . It is known that for all .
For , let be defined by
[TABLE]
Lemma 2.7**.**
Suppose that and for any , let where
[TABLE]
then the following inequality holds:
[TABLE]
Proof.
When , one has . For , write
[TABLE]
For , one has
[TABLE]
Suppose that an integer satisfies
[TABLE]
Then
[TABLE]
Consider the inequality
[TABLE]
By monotone property of the logarithm function, the following inequality is equivalent to (2.9) above:
[TABLE]
For , the function
[TABLE]
satisfies , has global minima , and is increasing. Since and , the condition is satisfied; because , the following inequality implies (2.10):
[TABLE]
We claim that, for and , the following inequality holds:
[TABLE]
This is equivalent to
[TABLE]
Writing , this is equivalent to . The function has derivative , which is increasing for , and . This proves the claim.
Thus, implies
[TABLE]
∎
Remark 2.12*.*
If and , one has
[TABLE]
We write
[TABLE]
Then the lemma above implies
[TABLE]
for where .
Lemma 2.13**.**
Let . For all and all , following inequality holds:
[TABLE]
Proof.
For any , addition theorem implies
[TABLE]
The function , for , satisfies
[TABLE]
which shows that . This yields
[TABLE]
If then , which implies that, for where , and any , one has
[TABLE]
∎
3. Hausdorff distance
We recall the following definition:
Definition 3.1**.**
Given a subset , and , let be the union of all -neighbourhoods of points in ; the Hausdorff distance is defined to be
[TABLE]
Our analysis in this section will be based on an application of the following theorem, first appeared in [1].
Theorem 3.2** (Ahlswede-Winter).**
Let be a finite dimensional Hilbert space, with . Let be independent identically distributed random variables taking values in the cone of positive semidefinite operators on , such that for some , and . Then, for all , the following holds:
[TABLE]
Let be a non-empty subset, with . For , let
[TABLE]
Recall (inequality 2.3) that . Because is -invariant, the subspace is invariant under the operators
[TABLE]
Due to rotation invariance of the surface probability measure , the operators turns out to be self-adjoint. Positive semidefiniteness of follows from the identity
[TABLE]
Moreover, writing for the unique right-invariant Haar (probability) measure on , one has
[TABLE]
Writing for the map , one has
[TABLE]
where . Since is rotation-invariant measure on , one has . Because , one has
[TABLE]
Therefore, , which makes
[TABLE]
Furthermore, the operators are positive semidefinite for all , because
[TABLE]
Theorem 3.6**.**
Let be a set of order , chosen independently, and uniformly at random from the Haar measure on and let be the (multi)set of all elements in and their inverses. Let satisfy
[TABLE]
where . Let where . For
[TABLE]
and any integer satisfying , the following inequality holds:
[TABLE]
Proof.
Since and , one has . Setting and in (3.3) yields
[TABLE]
Therefore, for all , the following inequality holds:
[TABLE]
In particular, writing , one has
[TABLE]
Iterating this inequality times fetches
[TABLE]
Let ; then
[TABLE]
Hence, using lemma 2.7 and 2.13 in inequality (3.8), one derives
[TABLE]
Since and is inverse-symmetric, this produces (3.7). ∎
The following theorem has appeared in [16]:
Theorem 3.9** (Nowak-Sjögren-Szarek).**
Let be the heat kernel on the sphere , corresponding to Brownian motion initiated at . Let and fix . Let be the Riemannian distance, so that . Then, for all , the inequality
[TABLE]
holds for some constants depending only on and .
Let and let be such that . Then, taking in the above upperbound, one has
[TABLE]
Notice that the constant is independent of the initial point . Letting , we have
[TABLE]
Lemma 3.11**.**
Let where . If is sufficiently small, then the following inequality implies that is an -net:
[TABLE]
Proof.
Let ; then, for any , and all satisfying the inequality
[TABLE]
it follows from (3), and positivity of the heat kernel, that , and (hence)
[TABLE]
Write for the Riemannian disk of radius , centered at . Let denote the unit euclidean ball; one has (see [8])
[TABLE]
Here “vol” denotes the standard Lebesgue volume. Now suppose, if possible, that (3.12) is satisfied, and yet, is not an -net, so that there is such that . Writing
[TABLE]
we derive from (3.12) and (3.13)
[TABLE]
which produces
[TABLE]
Considering (3), this is impossible if is sufficiently small. ∎
Theorem 3.15**.**
Let be small, and . Let consist of iid random points, drawn from the Haar measure on , where
[TABLE]
with . Let ; if is sufficiently small then the probability that is an -net in is at least .
Proof.
One sees by the remark 2.12 (following lemma 2.7) that for any , if
[TABLE]
and , then the following inequality holds:
[TABLE]
Let , so that for sufficiently large (to be determined à la theorem 3.2) the parameter ensures
[TABLE]
Taking logarithm of (3.16), we find that it suffices to take
[TABLE]
Suppose that is large enough so that ; for this to be true, we require . We enforce the inequality by requiring
[TABLE]
Threfore, if , then holds. Since is small, one has . Thus, by theorem 3.6, the following inequality holds:
[TABLE]
The proof is complete by lemma 3.11.
∎
4. Equidistribution and Wasserstein Distance
Let be a compact connected metric space. Let be the Banach space of continuous functions on , and its dual — consisting of linear functionals on — equipped with topology; recall that, by compactness of , every linear functional is bounded, and hence, continuous. Let be the space of all finite Borel measures on . By Reisz-Markov theorem, there is a bijection , defined by
[TABLE]
that is closed under addition and scalar multiplication. The space inherits the sequential topology on via this bijection. Thus, one says if and only if
[TABLE]
for every . Since the Lipschitz functions are dense in , it suffices to consider only the 1-Lipschitz functions in the above limit.
For probability measures , the Prokhorov distance is defined to be
[TABLE]
where . This gives a metric on the convex subspace of probability measures on , and — by Prokhorov’s theorem — the induced metric topology on is the subspace of the weak topology on ; moreover, the space is compact.
We recall that, in a metric space , the 1-Wasserstein distance between two regular Borel probability measures and on is defined to be
[TABLE]
where is the space of couplings of and ; that is, is the space of all regular Borel probability measures on such that the following holds: if and only if for every Borel set , one has
[TABLE]
Let be the space of all 1-Lipschitz functions on ; we recall that, for any , one says if and only if for all . The following duality theorem first appeared in [12].
Theorem 4.1** (Kantorovič - Rubinšteín).**
For any , the following equality holds:
[TABLE]
Definition 4.3**.**
Let . Let be a Borel probability measure on the metric space . A finite nonempty subset is said to be strongly -equidistributed if the following inequality holds:
[TABLE]
As mentioned before, for to be strongly -equidistributed, it suffices to have a constant such that
[TABLE]
Below we show strong -equidistribution of a subset of of appropriate size and low degree of randomness.
The following lemma will be useful in course of proving the main theorem of this subsection.
Lemma 4.6** (Fourier convergence).**
Fix , and let ; then the Fourier-Laplace expansion of converges uniformly to in .
Proof.
Fix an orthonormal basis for the eigenspace . Then the Fourier-Laplace expansion of the heat kernel based at is
[TABLE]
Write
[TABLE]
so that
[TABLE]
One has
[TABLE]
by rotation invariance of the sum (see Lemma 2.19 and 2.29, [14])
[TABLE]
Therefore,
[TABLE]
and since
[TABLE]
this forces
[TABLE]
The Weierstrass’ test implies uniform convergence of , to a continous function on ; the claim follows by uniqueness of the continuous limit. ∎
Lemma 4.8**.**
Let be the metric distance on . Let be the uniform surface probability measure on . For all , one has
[TABLE]
Proof.
Without loss of generality we may assume that is embedded in as the unit sphere with center at , and . We write , and let be the Borel measure whose Radon-Nikodym derivative is
[TABLE]
Let be a standard Brownian motion on with infinitesimal generator . For each positive integer , consider the equi-partition
[TABLE]
where for . Now define as follows:
[TABLE]
By linearity of expectation, for any integer one has
[TABLE]
Fix a realization of the Brownian motion . For integer , we consider the tangent space . Write
[TABLE]
In explicit terms, one has
[TABLE]
Orthogonality relations such as
[TABLE]
are immediate; moreover, one has
[TABLE]
where is the unit normal to pointing inward. From the inequalities
[TABLE]
one has . Hence,
[TABLE]
Suppose and in . Let be such that
[TABLE]
Since the function
[TABLE]
takes arbitrarily small positive values, such a point — that satisfies (4.10) — exists by continuity of and compactness of . Note that
[TABLE]
by independence of increments for Brownian motion on euclidean space. From
[TABLE]
we derive
[TABLE]
It thus suffices to prove
Lemma 4.11**.**
[TABLE]
Proof.
We will use the stereographic projection of onto . It can be shown (see for example [7]) that the image of a standard Brownian motion on via the stereographic projection onto , where , with being the Laplacian on , has an infinitesimal generator that satisfies
[TABLE]
Applying this to the function , we see that
[TABLE]
It follows that for any ,
[TABLE]
converges as to , proving the lemma. ∎
∎
Corollary 4.12**.**
Let and for integers , let be the Legendre polynomial of degree and dimension . Let and
[TABLE]
Then the following inequality holds for all :
[TABLE]
Proof.
We recall the Hecke-Funk formula: for any function , which satisfies the inequality
[TABLE]
and any eigenfunction and point one has
[TABLE]
Consider the function , taking values in ; this satisfies the hypothesis in Hecke-Funk formula, and since , one gets
[TABLE]
Consider the Fourier- Laplace expansion of the heat kernel, as in lemma 4.6 above. By uniform convergence (lemma 4.6) of the Fourier-Laplace expansion of the heat kernel, one has
[TABLE]
Note that
[TABLE]
for all . Hence, Lemma 4.8 together with Hölder inequality implies
[TABLE]
∎
Theorem 4.14**.**
For , let be the probability measure on corresponding to . Let be sufficiently small and . Let be a random subset such that satisfies the inequality in theorem 3.15, namely
[TABLE]
where . Let and let be the uniform probability measure on , supported on , where as before and
[TABLE]
Then, with probability at least , the following inequality holds:
[TABLE]
Proof.
Let be the set of mean-zero -functions on . By theorem 4.1, it suffices to show that
[TABLE]
For any such function , if then
[TABLE]
For sufficiently small , we let be the Borel probability measure on whose density is
[TABLE]
Then, for , one has
[TABLE]
with probability at least .
For any function , define to be
[TABLE]
From uniform convergence of the Fourier-Laplace expansion of heat-kernel, it follows that ; hence, putting , one has
[TABLE]
Moreover,
[TABLE]
by lemma 4.8 and Hölder inequality applied to . Therefore, for sufficiently small, equations (4.16), (4), and (4) yield
[TABLE]
∎
5. Conclusion
We proved two results about the finite time behavior of a random Markov Chain on the sphere whose transitions correspond to rotations chosen uniformly at random. The first result states that for random rotations and , if one takes the image of the north pole on the sphere under all possible words of length in the alphabets and their inverses, one obtains an net with high probability. For these parameters, the value of is close to the volumetric lower bound of on the size of an net of . Secondly, we show that this net is equidistributed with probability at least in the sense that the Wasserstein distance of the uniform measure on the net is within of the uniform measure on .
These results can respectively be applied to approximately minimize a Lipschitz function on the sphere (by evaluation on the net) and in to approximately integrate a Lipschitz function on the sphere. In both cases the approximation is within an additive of the true value.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] R. Ahlswede and A. Winter, Strong converse for identification via quantum channels , IEEE Trans. Inform. Theory 48 (2002), no. 3, 569–579.
- 2[2] N. Alon, and Y. Roichman, Random Cayley Graphs and Expanders , Random Structures Algorithms 5 (1994), no. 2, 271–284.
- 3[3] Y. Benoist, and N. de Saxcé, A Spectral Gap Theorem in Simple Lie Groups , Invent. Math. 205 (2016), no. 2, 337–361.
- 4[4] J. Bourgain, On random walks in large compact Lie groups , Geometric aspects of functional analysis, 55–63, Lecture Notes in Math. (2169), Springer, Cham, 2017
- 5[5] J. Bourgain, and A. Gamburd, A spectral gap theorem in S U ( d ) 𝑆 𝑈 𝑑 SU(d) , J. Eur. Math. Soc. (JEMS) 14 (2012), no. 5, 1455–1511.
- 6[6] by same author, On the spectral gap for finitely-generated subgroups of S U ( 2 ) 𝑆 𝑈 2 SU(2) , Invent. Math. 171 (2008), no. 1, 83–121.
- 7[7] T. Carne, Brownian motion and stereographic projection , Ann. Inst. H. Poincaré Probab. Statist. 21 (1985), no. 2, 187–196.
- 8[8] A. Gray, The volume of a small geodesic ball of a Riemannian manifold , Michigan Math. J. 20 (1973), 329–344 (1974).
