Uniform spaces and the Newtonian structure of (big)data affinity kernels
Hugo Aimar, Ivana G\'omez

TL;DR
This paper demonstrates that under mild conditions, data affinity kernels can be represented as Newtonian potentials with a quasi-metric structure, revealing an underlying geometric framework for big data analysis.
Contribution
It establishes that affinity kernels satisfying specific conditions can be expressed as Newtonian potentials with an associated quasi-metric, linking data affinity to geometric structures.
Findings
Affinity kernels can be represented as Newtonian potentials.
The underlying space admits a quasi-metric structure.
A quantitative transitivity condition is established.
Abstract
Let be a (data) set. Let be a measure of the affinity between the data points and . We prove that has the structure of a Newtonian potential with decreasing and a quasi-metric on under two mild conditions on . The first is that the affinity of each to itself is infinite and that for the affinity is positive and finite. The second is a quantitative transitivity; if the affinity between and is larger than and the affinity of and is also larger than , then the affinity between and is larger than . The function is concave, increasing, continuous from onto with for every .
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Harmonic Analysis Research · Nonlinear Partial Differential Equations · Geometric Analysis and Curvature Flows
Uniform spaces and the Newtonian structure of (big)data affinity kernels
Hugo Aimar
and
Ivana Gómez
Abstract.
Let be a (data) set. Let be a measure of the affinity between the data points and . We prove that has the structure of a Newtonian potential with decreasing and a quasi-metric on under two mild conditions on . The first is that the affinity of each to itself is infinite and that for the affinity is positive and finite. The second is a quantitative transitivity; if the affinity between and is larger than and the affinity of and is also larger than , then the affinity between and is larger than . The function is concave, increasing, continuous from onto with for every .
Key words and phrases:
uniform spaces; metric spaces; affinity kernel
2010 Mathematics Subject Classification:
Primary 54E15, 54E35
1. Introduction
Isaac Newton in the seventeenth century started the endless quantitative approach to the understanding of nature. The quantitative character of the formulation of the Law of Universal Gravitation, should not hide its deep qualitative aspects. Now, more than 300 years later, we are able to explain the central fields as gradients of radial potentials centered at the “object” generating the field. Usually the potential take the form where is a decreasing profile and is the distance of the point , where the field is to be measured, to the origin of coordinates supporting the mass or the charge that generates the field. In the Euclidean -dimensional space the profile gives the fundamental solution of the Laplacian. And the kernel provides the basic information in order to produce the continuous models by convolution with the mass or charge densities that determine the system. These facts are also the starting points for harmonic analysis.
Our aim in this paper is to use arguments and results strongly related to the theory of uniform spaces, in order to give sufficient conditions on a kernel function defined on an abstract setting, in such a way that with a decreasing profile and a (quasi)metric on .
In other words, we aim to obtain an abstract form of Newtonian potentials for general kernels. Our approach in the search of conditions on the kernel will be based in the heuristic associated to the interpretation of as an affinity matrix for (big) data. Amazingly enough the abstract of the paper [2] of the Coifman Group leading the harmonic analysis approach to determine the structure of big data sets, reads “The process of iterating or diffusing the Markov matrix is seen as a generalization of some aspects of the Newtonian paradigm, in which local infinitesimal transitions of a system lead to global macroscopic by integration.” In some sense the result of this paper shows that also the potential theoretic Newtonian view of nature is still close to these problems. By the way, our approach could be a good example of how qualitative aspects of a system leads to structural results of the model.
Let be a set (data set). Each element of is understood as a data point. Let be a nonnegative number measuring the affinity between the two data points and . We shall consider some basic properties of affinity which will be sufficient to obtain the Newtonian potential type structure for . Symmetry; affinity is a symmetric relation on ( for every , ). Positivity; there is positive affinity between any couple of data points and ( for every , ). Diagonal singularity; the self affinity is unimprovable. Precisely, the affinity of each data point with itself is ( for every ) but for the affinity is finite ( for ). Quantitative Transitivity; if the affinity between the data points and is larger than and the affinity between and is larger than then the affinity between the points and is larger than . Here is a nonnegative, concave, increasing and continuous function defined on onto such that .
A quasi-metric in is a nonnegative symmetric function defined on which vanishes only on the diagonal of and satisfies a weak form of the triangle inequality, there exists a constant () such that the inequality holds for every , and in .
The main result of this paper can be stated as follows.
Theorem 1**.**
Let be a set. Let be a symmetric function satisfying the singularity and the quantitative transitivity conditions. Then there exist a decreasing and continuous function defined in and a quasi-metric on such that
[TABLE]
Moreover, with a metric on and a symmetric function such that for some constants satisfies for every and in .
The deepest results on the structure of quasi-metrics are due to Macías and Segovia and are contained in [5]. See also [1]. The most significant for our purposes is the fact that each quasi-metric is equivalent to a power of a metric. In other words, given a quasi-metric on with constant , there exist and a metric on such that for some positive constants and the inequalities
[TABLE]
hold for every and in . Actually the proof is based in Frink’s lemma of metrization of uniform structures with a countable basis ([3], [4]).
The rest of the paper is organized in the following way. Section 2 gives a characterization of quasi-metrics on a set in terms of properties of the family of stripes in induced by the quasi-metric. Section 3 contains the construction of the monotonic profile. In Section 4 we prove the main result.
2. Quasi-metrics and families of stripes around the diagonal
Let be a set. The composition of two subsets and of is given by . A subset in is said to be symmetric if if and only if . Set to denote the diagonal in , i.e. . When a quasi-metric with constant is given in , it is easy to check that the one parameter family ; , of stripes around the diagonal of , satisfies
- (S1)
each is symmetric; 2. (S2)
, for every ; 3. (S3)
, for ; 4. (S4)
; 5. (S5)
; 6. (S6)
there exists such that , for every .
Actually, the constant in (S6) can be taken to be the triangle constant of . Set to denote the set of subsets of .
Theorem 2**.**
Let be a one parameter family of the subsets of that satisfies (S1) to (S6) above. Then the function defined on by is a quasi-metric on with . Moreover, for each , we have
[TABLE]
hold for every , where .
Proof.
From (S4) for the family we see that is well defined as a nonnegative real number. The symmetry of follows from (S1). The fact that vanishes on the diagonal follows from which is contained in (S2). Now, if and , then, from (S3) for each , . Now, from (S5) we necessarily have that or, in other words . Let us check that satisfies a triangle inequality. Let , and be three points in . Let . Take and such that , , and . From (S6) with , we have . Hence and we get the triangle inequality with . Notice first that from (S3), for every . Take now , then , so that for every and every . ∎
The next result follows from the above and the metrization theorem of quasi-metric spaces proved in [5] as an application of Frink’s Lemma on metrizability of uniform spaces with countable bases.
Theorem 3**.**
Let and be as in Theorem 2. Then, there exist a constant and a metric on such that
- (i)
; 2. (ii)
* where .*
Proof.
Following the proof of Theorem 2 on page 261 in [5] take such that and . With the metric provided by the metrization theorem for uniform spaces with countable bases, we have that . So that
[TABLE]
and (ii) follow from these inequalities and (2.1) with . ∎
3. Building the basic profile shape
The classical inverse proportionality to the square of the distance between the two bodies for the gravitation field, translates into inverse proportionality to the distance for the potential. That is for the gravitational potential.
This section is devoted to the construction of the basic shapes of the profiles that allow the Newtonian representation of the kernels. This construction requires to solve a functional inequality involving the function that controls the quantitative transitive property of .
Proposition 4**.**
Let be a concave, continuous, nonnegative and increasing function defined on onto such that for every . Then, given , there exists a continuous, decreasing and convex function defined on with such that the inequality
[TABLE]
holds for every .
Proof.
Set , and . Notice that and . In fact, and . Set for , and . Notice that decreases as and increases when . The continuity of and the property for every positive imply that as and as monotonically. This basic sequence allows to construct a function in the following way. Set , and for define by linear interpolation. It is clear that is continuous, decreasing, , , and that is convex. We only have to check that solves inequality (3.1). On the sequence , (3.1) becomes an equality. In fact, .
Let us now take for . For such , satisfies
[TABLE]
On the other hand, since , we have that . Hence satisfies
[TABLE]
[TABLE]
and
[TABLE]
Now, since is concave, we have for that
[TABLE]
hence, . ∎
The basic shapes for the profiles in our main result will be given as composition of the inverse of with power laws.
4. Proof of the main result
Let us start by rewriting, formally, the properties of symmetry, positivity, singularity and transitivity of a data affinity kernel defined on the set . Let such that
- (K1)
, for every and in ; 2. (K2)
, for every and in ; 3. (K3)
if and only if ; 4. (K4)
there exists a continuous, concave, increasing and nonnegative function defined on onto , with , , such that whenever there exists with and , for every .
With these properties, Theorem 1 can be restated as follows.
Theorem 5**.**
Let be a set. Let be a kernel on satisfying properties(K1) to (K4). Then, there exist a metric on , a real number , a function defined on with and a function continuous, decreasing with and , such that
[TABLE]
Proof.
Let be the function provided by (K4). Let be given by Proposition 4 with this function , and . Hence for every . Take and given by
[TABLE]
Let is check that satisfies properties (S1) to (S6) in Section 2 with constant (). From (K1) we see that each is symmetric, in particular is symmetric for every . Since , from (K3), we have that for . In order to check (S3) take . Hence , so that implies . Or, in other words . Or . The positivity (K2) of shows (S4). Property (S5) of follows from (S3). To prove (S6) for , take . If , then there exists such that and . From (K4), . Now applying (3.1) with we get , or . Hence (S6) for holds with . We can, then apply the results of Section 2. First to produce a quasi-metric as in Theorem 2 and then to provide the metric and the exponent given in Theorem 3. Thus, for every , , where is a metric in and, since can be taken to be equal to 2, works. The above inclusiones, taking , are equivalent to
[TABLE]
for every . Let and be two different points in . Since so is . There exists, then, a unique () such that . By the second inclusion in (4.1) we see that . On the other hand, since , from the first inclusion in (4.1) we necessarily have that . Hence for we have
[TABLE]
Set for and . Then and with . ∎
Notice that since is symmetric and bounded above and below by positive constants the function is a quasi-metric. But actually is better than a general quasi-metric since its triangular inequality constant can be taken to be independent of the length of chains. Precisely, .
Let us observe also that Newtonian type power laws are obtained when for . In fact, with , solves the equation . Hence becomes also a power law.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] Hugo Aimar, Bibiana Iaffei, and Liliana Nitti. On the Macías-Segovia metrization of quasi-metric spaces. Rev. Un. Mat. Argentina , 41(2):67–75, 1998.
- 2[2] R. R. Coifman, S. Lafon, A. B. Lee, M. Maggioni, B. Nadler, F. Warner, and S. W. Zucker. Geometric diffusions as a tool for harmonic analysis and structure definition of data: Diffusion maps. Proceedings of the National Academy of Sciences of the United States of America , 102(21):7426–7431, 2005.
- 3[3] A. H. Frink. Distance functions and the metrization problem. Bull. Amer. Math. Soc. , 43(2):133–142, 1937.
- 4[4] John L. Kelley. General topology . Springer-Verlag, New York-Berlin, 1975. Reprint of the 1955 edition [Van Nostrand, Toronto, Ont.], Graduate Texts in Mathematics, No. 27.
- 5[5] Roberto A. Macías and Carlos Segovia. Lipschitz functions on spaces of homogeneous type. Adv. in Math. , 33(3):257–270, 1979.
