Rotatable random sequences in local fields
Steven N. Evans, Daniel Raban

TL;DR
This paper extends classical results on rotatable sequences and positive definite functions from real numbers to local fields, using probabilistic methods to establish analogues of Freedman's and Schoenberg's theorems.
Contribution
It introduces a local field analogue of Freedman's theorem for rotatable sequences and derives a corresponding version of Schoenberg's classical result.
Findings
Established a probabilistic proof of the local field analogue of Freedman's theorem.
Derived a local field version of Schoenberg's theorem on positive definite functions.
Obtained a local field counterpart of the Maxwell-Poincaré-Borel observation on sphere distributions.
Abstract
An infinite sequence of real random variables is said to be rotatable if every finite subsequence has a spherically symmetric distribution. A celebrated theorem of Freedman states that is rotatable if and only if for all , where is a sequence of independent standard Gaussian random variables and is an independent nonnegative random variable. Freedman's theorem is equivalent to a classical result of Schoenberg which says that a continuous function with is completely monotone if and only if given by is nonnegative definite for all . We establish the analogue of Freedman's theorem for sequences of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Rotatable Random Sequences in Local Fields
Steven N. Evans
Department of Statistics #3860
367 Evans Hall
University of California
Berkeley, CA 94720-3860
USA
and
Daniel Raban
UCLA Mathematics Department
Box 951555
Los Angeles, CA 90095-1555
USA
Abstract.
An infinite sequence of real random variables is said to be rotatable if every finite subsequence has a spherically symmetric distribution. A celebrated theorem of Freedman states that is rotatable if and only if for all , where is a sequence of independent standard Gaussian random variables and is an independent nonnegative random variable. Freedman’s theorem is equivalent to a classical result of Schoenberg which says that a continuous function with is completely monotone if and only if given by is nonnegative definite for all . We establish the analogue of Freedman’s theorem for sequences of random variables taking values in local fields using probabilistic methods and then use it to establish a local field analogue of Schoenberg’s result. Along the way, we obtain a local field counterpart of an observation variously attributed to Maxwell, Poincaré, and Borel which says that if is uniformly distributed on the sphere of radius in , then, for fixed , the distribution of converges to that of a vector of independent standard Gaussian random variables as .
Key words and phrases:
-adic; -series; exchangeable; spherically symmetric; Gaussian; total variation; completely monotone; nonnegative definite
2010 Mathematics Subject Classification:
primary 60B99; 60G09; secondary 12J25
SNE supported in part by NSF grant DMS-1512933 and NIH grant 1R01GM109454-01.
1. Introduction
An -vector of real-valued random variables is rotatable if has the same distribution as for every orthogonal matrix ; that is, the distribution of is spherically symmetric. Similarly, an infinite sequence of real-valued random variables is rotatable if the vectors are rotatable for every (here we use the notation ). Because permutation matrices are orthogonal, it follows that a rotatable infinite sequence is exchangeable and hence, by de Finetti’s theorem, distributed as a mixture of independent, identically distributed sequences. A famous result of Maxwell [Max75, Max78] says that if a real random vector is spherically symmetric and has independent (necessarily identically distributed) entries, then the distribution of the entries is centered Gaussian. Combining these two observations makes plausible the celebrated theorem of Freedman [Fre62] (see, also, [Kel70, Kin72, Eat81, Let81, Smi81] that a rotatable infinite sequence of real-valued random variables is a scale mixture of sequences of independent standard Gaussian random variables; that is, if is an infinite rotatable sequence, then , where is a sequence of independent standard Gaussian random variables and is a nonnegative random variable that is independent of .
We refer the reader to the Historical and Bibliographical Notes in [Kal05] for an indication of the later literature around Freedman’s theorem and for remarks on the connection between this result and the classical theorem of Schoenberg [Sch38] which says that a continuous function with is completely monotone if and only if given by is nonnegative definite for all .
Our primary goal, realized in Theorem 3.3, is to obtain an analogue of Freedman’s theorem for sequences of random variables taking values in fields other than the fields of real and complex numbers. More specifically, we consider the local fields; a local field is any locally compact, non-discrete topological field other than or (a local field is, for some prime , a finite algebraic extension of either the field of -adic numbers or the field of Laurent series over the finite field of integers modulo ). We recall some facts about the structure of local fields and vector spaces over them in Section 2. Just as in the real case, Freedman’s theorem is equivalent to a structure theorem for nonnegative definite functions, and we present such a result in Section 4.
In order to search for a counterpart Freedman’s theorem we need to have a parallel for the group of orthogonal matrices so that we can say what it means for a distribution to be “spherically symmetric” in the local field setting. The orthogonal matrices are, of course, the matrices that preserve the Euclidean metric on -dimensional space and so, denoting our local field by , we take as our parallel of the orthogonal matrices those matrices that are isometries of , where is equipped with the natural metric described in Subsection 2.2 – see Subsection 2.3.
Not surprisingly, our counterpart of Freedman’s theorem also involves a suitable parallel of the class of Gaussian measures in a local field setting. Such an analogue was considered in [Eva89, Eva01]. The idea is that, given one has a notion of spherical symmetry, one can take the property embodied in Maxwell’s theorem to be the definition of Gaussianity for local fields. We recall some of the elementary properties of Gaussian probability measures on local fields in Subsection 2.6 after we have laid some groundwork on Haar measure and Fourier theory on local fields in Subsection 2.4 and Subsection 2.5, respectively.
2. Local fields
From now on, let be a fixed local field. We refer the reader to [Sch06, vR78, Tai75] for in-depth treatments of various aspects of analysis on local fields. Any result that we state without a proof or a bibliographic citation may be found in these references.
2.1. Basics
There is a distinguished real-valued mapping on which we denote by ; this map has the properties
[TABLE]
and takes the values , where for some prime and positive integer .
A map with properties (1.1)-(1.3) is called a non-Archimedean valuation. Property (1.3) is known as the ultrametric inequality or the strong triangle inequality. The mapping on is a metric on which induces the topology of . The metric space is a complete, totally disconnected, ultrametric space under this metric.
Write for the closed unit ball . Choosing so that , we have
[TABLE]
for each ; in particular, is both open and closed for each .
The set is a ring, called the ring of integers of . Each of the sets , , is a compact -submodule of , and every non-trivial compact -submodule of is of this form. For the additive quotient group has order . Consequently, is the union of disjoint translates (that is, cosets) of . Each of these cosets is, in turn, the union of disjoint translates of , and so on.
Remark 2.1*.*
We can thus think of the collection of balls contained in as being arranged in an infinite rooted -ary tree: the root is itself, the nodes at level are the balls of radius (cosets of ), and the “children” of such a ball are the cosets of that it contains. We can uniquely associate each point in with the sequence of balls that contain it, and so we can think of the points in as the boundary of this tree.
2.2. Norms
A norm on a vector space over the local field is a real-valued mapping on with the properties
[TABLE]
The norm induces a metric on by . The resulting metric space is an ultrametric space. We always take the norm on to be the one given by
[TABLE]
The space is complete under this metric. In general, a -Banach space is a vector space over that is equipped with a norm such that the resulting metric space is complete.
2.3. Orthogonality
The following definition of orthogonality in a normed vector space over mimics a characterization of orthogonality in an inner product space over that does not explicitly involve the inner product and instead is in terms of the induced metric.
Definition 2.2**.**
Given a subset of a normed vector space over , write for the linear span of . A subset of is -orthogonal if for all and ; that is, the best approximation of in the vector space is [math]. Equivalently, a subset is -orthogonal if for any finite subset and collection of scalars ,
[TABLE]
A subset of is -orthonormal if it is -orthogonal and for all .
Remark 2.3*.*
If is -orthonormal, then, for any finite subset and collection of scalars , Conversely, suppose that for any finite subset and collection of scalars we have Taking and for , , we see that for and hence that is, is -orthonormal.
Theorem 2.4**.**
The following are equivalent for an matrix .
- (i)
The matrix is an isometry of .
- (ii)
The matrix is invertible and the entries of both and lie in .
- (iii)
The columns of are -orthonormal.
- (iv)
The rows of are -orthonormal.
- (v)
The entries of lie in and .
Proof.
(i) (ii): See Remark 3.2 in [Eva02].
(i) (iii): Note that
[TABLE]
where we applied Remark 2.3 for the last equivalence.
(i) (iv): We have already shown that (i), (ii), and (iii) are equivalent. It remains to observe that is invertible with and both having entries in if and only if the transpose is invertible with and both having entries in .
(ii) (v): If (ii) holds, then it follows from the properties of the valuation that and , so that and (v) holds. Conversely, if (v) holds, then (ii) follows from Cramer’s rule. ∎
Notation 2.5**.**
In light of Theorem 2.4, we write for the group of matrices that satisfy the equivalent conditions of the theorem and say that these matrices are -orthogonal.
Notation 2.6**.**
For , denote the unit sphere by .
Lemma 2.7**.**
The group of matrices acts transitively on the set ; that is, given with , there is exists such that .
Proof.
Let , , be the coordinate vectors in ; that is is the vector with in the coordinate and [math] elsewhere. Because is a group, it suffices to show that for any with there exists a matrix such that .
Fix such an . Because , there is at least one coordinate, say , such that . Let be an matrix that has first column equal to and remain columns given by the vectors , , listed in some order. Note that . Clearly, so that and . ∎
2.4. Haar measure
There is a unique measure on which has the properties
[TABLE]
and
[TABLE]
for each , and
[TABLE]
the measure is just the suitably normalized Haar measure on the additive group of .
For , the measure on has the properties
[TABLE]
for each and
[TABLE]
for each matrix . In particular, is -invariant.
Notation 2.8**.**
Write for the restriction of the measure to and, for , set . Denote by the probability measure obtained by conditioning the probability measure on the set ; that is, is normalized to be a probability measure.
Proposition 2.9**.**
The probability measure is the unique probability measure on that is invariant under the action of .
Proof.
It is clear that is invariant under the action of . Because is a subset of that has positive measure (namely, ) that is invariant under the action of , it follows that is invariant under the action of . It therefore remains to establish the uniqueness claim. This, however, is immediate from Theorem 2.11 below and Lemma 2.7. ∎
Remark 2.10*.*
(i) The uniqueness claim in Proposition 2.9 may be established by an alternative route. Because is a compact, second countable, Hausdorff group, it possesses a Haar measure which is unique if we normalize it to be a probability measure. Denote this probability measure by . Suppose that is a -invariant probability measure on . For any positive Borel function on and we have and so . From Lemma 2.7 we know for each that there is a matrix such that , where . Therefore, by the invariance of , and hence is necessarily .
(ii) It is possible to describe the Haar measure on quite concretely. Let be the probability measure on matrices given by ; that is, if is distributed according to , then the entries of are independent and each is distributed according to . The Haar measure on is just conditioned on the set ; that is, the Haar measure is the restriction of to normalized to be a probability measure. Because we don’t need this result in what follows, we leave the (straightforward) proof to the reader. We note in passing that the measure of is – see [Eva02, Theorem 4.1].
(iii) As we have seen, the probability measure is -invariant. On the other hand, if the push-forward of by an matrix is again , then property (v) of Theorem 2.4 holds and . We may therefore add another equivalent property to the list in Theorem 2.4. A similar remark holds with replaced by .
For the sake of completeness, we state the following classical result on the existence and uniqueness of measures invariant under the action of a group (see, for example, [Kal02, Theorem 2.29]).
Theorem 2.11**.**
Let be a locally compact, second countable, Hausdorff group of measurable transformations on a locally compact, second countable, Hausdorff space . Suppose that acts properly (that is, the set is compact for all and compact ) and transitively (that is, given there exists such that ). Then, up to normalization, there is a unique non-zero -invariant Radon measure on .
The following corollary is an easy consequence of Proposition 2.9, but we include the proof for the sake of completeness.
Corollary 2.12**.**
Let be a -valued random variable such that has the same distribution as for all . Possibly on some extension of there is a -distributed, -valued random variable and a -valued random variable independent of such that . Consequently, if is a -invariant probability measure on , then is the push-forward of by the map for some probability measure on .
Proof.
Define events , , by and set . Let be equipped with the product of the -field and the Borel -field on . Let . Extend the definition of to by, with the usual abuse of notation, taking to be .
Set on the event , , and on the event . Put
[TABLE]
It is clear that . For any the conditional distribution of given the event is obviously invariant under the action of and so, by Proposition 2.9, the random variable is independent of with distribution . ∎
A classical theorem often attributed to Poincaré says that for fixed the distribution of the first coordinates of a point uniformly distributed over the sphere of radius in converges to the distribution of a vector of independent standard Gaussian variables as . See Section 6 of [DF87] for a discussion of the history of this result leading to the conclusion that a more appropriate attribution is to the work of Borel in [Bor06] (see also Chapter V of [Bor14]). It is shown in [DF87] that indeed the total variation distance between the distribution of the first coordinates of a point uniformly distributed over the sphere of radius in and the distribution of a vector of independent standard Gaussian variables converges to zero as provided that .
The analogue of such a result in the local field setting is the following. Note that the relevant total variation distance converges to zero as regardless of the relative size of with respect to .
Theorem 2.13**.**
For and , let be the push-forward of by the map that sends to . The total variation distance between the probability measures and is .
Proof.
Let have distribution and let have distribution so that has distribution and has distribution .
By definition, the distribution of is the same as the distribution of conditioned on the event . Write for the indicator of the event and for the indicator of the event . Let (resp. ) be conditioned on the event (resp. ); that is, (resp. ) is the conditional distribution of given the event (resp. ).
If , then the conditional distribution of given the event is the same as that of given the event ; both conditional distributions are . It follows that the total variation distance we seek is the same as the total variation distance between the distribution of and the distribution of . Furthermore, the conditional distribution of given the event is the same as the conditional distribution of given the event . Thus, it further suffices to compute the total variation distance between the distribution of and the distribution of .
Put and . Noting that the distribution of is the same as the conditional distribution of given the event , the total variation distance we seek is
[TABLE]
For we have
[TABLE]
and so
[TABLE]
Also,
[TABLE]
and
[TABLE]
so that
[TABLE]
Therefore,
[TABLE]
as claimed. ∎
2.5. Fourier theory
Recall that a character on a locally compact Abelian group with the group operation written additively is a map such that for all .
It is possible to fix a character for such that restricted to the subgroup is trivial (that is, always takes the value ) while restricted to the subgroup is non-trivial. Fixing any such choice of the character , an arbitrary character on is of the form for some . More generally, an arbitrary character on is of the form for some , where is the usual dot product of the vectors and .
A probability measure on has a Fourier transform , . For example, .
2.6. Gaussian random variables
For an introduction to the/an analogue of Gaussian probability measures on local fields and the proofs of any results stated without proof in this subsection, see [Eva89, Eva01].
The following definition mimics a standard definition/characterization of Gaussian random variables taking values in a Banach spaces over the real numbers.
Definition 2.14**.**
Let be a separable -Banach, and let be an -valued random variable. The random variable is -Gaussian if whenever and are independent copies of and are -orthonormal, then has the same distribution as .
Theorem 2.15**.**
Let be a separable -Banach space and let be an -valued random variable.
- (i)
The random variable is -Gaussian if and only if the distribution of is Haar measure on some compact -module of . In particular, if , then is -Gaussian if and only if the distribution of is either the point mass at [math] or the restriction of the Haar measure to one of the sets , , normalized to be a probability measure.
- (ii)
The random variable is -Gaussian if and only if the -valued random variable is -Gaussian for all , where is the dual space of continuous, linear maps from to .
- (iii)
If is -Gaussian, then the compact -module in (i) is the set
[TABLE]
where for a -valued random variable .
- (iv)
The random variable is -Gaussian if and only if is finite for all and, for all ,
[TABLE]
Definition 2.16**.**
A -valued random variable is standard -Gaussian if its distribution is .
3. Rotatable random sequences
The following analogue of Theorem 2 in [DF87] follows from a combination of Corollary 2.12 and Theorem 2.13.
Proposition 3.1**.**
Suppose that is a -invariant probability measure on so that is the push-forward of by the map for some probability measure on . The total variation distance between and the push-forward of by the map is at most .
Definition 3.2**.**
An infinite sequence of -valued random variables is rotatable if has the same distribution as for every and .
Theorem 3.3**.**
A -valued random infinite sequence is rotatable if and only if is almost surely finite and (possibly on some extension of ) , , where
- •
* are independent, identically distributed, standard -Gaussian random variables,*
- •
* is the random variable taking values in given by*
[TABLE]
- •
the random variable is independent of the sequence .
Proof.
For each , let
[TABLE]
and let be distributed according to and independent of . Observe that, by Corollary 2.12, has the same distribution a . Now let be an infinite sequence of independent, identically distributed, standard -Gaussian random variables independent of . Writing for the distribution of a random variable and for the total variation distance, we have
[TABLE]
which goes to [math] as by Theorem 2.13. So certainly converges in distribution to .
Now is increasing with , and
[TABLE]
so that almost surely for some random variable taking values in .
Thus has the same distribution as . The almost sure result then follows from the transfer lemma (Corollary 6.11 from [Kal02]); we thus have random variables such that has the same distribution as and almost surely. It remains to observe that
[TABLE]
almost surely, so that almost surely. ∎
There are several extensions and variants of Freedman’s theorem in the literature – see [DEL92] for a review. One extension is to consider infinite random sequences where the individual entries take values in for some or, more generally, in some separable Banach space over as in [Daw78]. The analogue of the result in [Daw78] holds in the local field setting, as we now explain. We say that a random sequence with entries in a separable -Banach space is rotatable if for any and we have that has the same distribution as . Write for the set of Gaussian probability measures on . The set is a closed subset of the Polish space of probability measures on the Polish space , where we equip the space of probability measures with the topology of weak converence. By Theorem 2.15, is in a bijective correspondence with the set of compact -modules in . The analytic content of Theorem 3.3 is that the distribution of a rotatable sequence of -valued random variables is for some probability measure on and it is not too difficult to establish the following generalization that is a local field counterpart to the result in [Daw78]. We omit the proof.
Theorem 3.4**.**
An infinite sequence of random variables in a separable -Banach space is rotatable if and only if the distribution of is for some probability measure on .
Other variants of Freedman’s theorem involve considering infinite random sequences such that for all the random vector has the same distribution as for all , where is some subgroup of the orthogonal group that contains the subgroup of permutation matrices. For example, [Smi81] considers the case where is the subgroup that fixes the vector and shows that in this case , where is an independent, identically distributed, standard Gaussian sequence and is an independent -valued random vector. It would be interesting to investigate the counterparts of such results in the local field setting, but we leave this to a future paper.
4. A local field analogue of Schoenberg’s theorem
In the case of real-valued rotatable random variables over , Freedman’s theorem is equivalent to a result about nonnegative definite functions (see Theorem 1.31 and Appendix 4 in [Kal05]) which we recalled in the Introduction. Here, we provide an analogue of the latter result in the local field setting.
Recall that a function defined on an Abelian group is nonnegative definite if for all , , and .
Theorem 4.1**.**
Let be such that . For , define by
[TABLE]
Then is nonnegative definite for every if and only if is nonnegative and nonincreasing.
Proof.
By Bochner’s theorem for locally compact groups (see Section 36 of [Loo53], also see [Wei40]), the function is nonnegative definite if there exists some -valued random variable such that for ; that is, is the characteristic function of . Using the definition of , this is
[TABLE]
We claim for and that has the same distribution as . Indeed, for ,
[TABLE]
and the uniqueness of characteristic functions completes the proof of the claim. By the Kolmogorov extension theorem, there exists an infinite -valued random sequence such that has the same distribution as for each .
We next claim that the random sequence is rotatable. Given and ,
[TABLE]
which shows that has the same distribution as .
Using Theorem 3.3, we have that almost surely, where is a sequence of independent, identically distributed, standard -Gaussian random variables. So is nonnegative definite for each if and only if for each . Thus
[TABLE]
This is equivalent to having the desired properties. ∎
Acknowledgments: We thank Persi Diaconis and an anonymous referee for a number of helpful suggestions.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[Bor 06] E. Borel, Sur les principes de la théorie cinétique des gaz , Annales scientifiques de l’É.N.S. 3 e superscript 3 𝑒 3^{e} serie 23 (1906), 9–32.
- 2[Bor 14] by same author, Introduction géométrique à quelques théories physiques , Gauthier-Villars, Paris, 1914.
- 3[Daw 78] A. P. Dawid, Extendibility of spherical matrix distributions , J. Multivariate Anal. 8 (1978), no. 4, 559–566. MR 520963
- 4[DEL 92] Persi W. Diaconis, Morris L. Eaton, and Steffen L. Lauritzen, Finite de Finetti theorems in linear models and multivariate analysis , Scand. J. Statist. 19 (1992), no. 4, 289–315. MR 1211786
- 5[DF 87] Persi Diaconis and David Freedman, A dozen de Finetti-style results in search of a theory , Ann. Inst. H. Poincaré Probab. Statist. 23 (1987), no. 2, suppl., 397–423. MR 898502
- 6[Eat 81] Morris L. Eaton, On the projections of isotropic distributions , Ann. Statist. 9 (1981), no. 2, 391–400. MR 606622
- 7[Eva 89] Steven N. Evans, Local field Gaussian measures , Seminar on Stochastic Processes, 1988 (Gainesville, FL, 1988), Progr. Probab., vol. 17, Birkhäuser Boston, Boston, MA, 1989, pp. 121–160. MR 990478
- 8[Eva 01] by same author, Local fields, Gaussian measures, and Brownian motions , Topics in probability and Lie groups: boundary theory, CRM Proc. Lecture Notes, vol. 28, Amer. Math. Soc., Providence, RI, 2001, pp. 11–50. MR 1832433
